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Abstract 

We give a nearly optimal sublinear-time algorithm for approximating the size of a minimum vertex 
cover in a graph G. The algorithm may query the degree deg(u) of any vertex v of its choice, and for 
each 1 < i < deg(i;), it may ask for the neighbor of v. Letting VCopt(G) denote the minimum 
size of vertex cover in G, the algorithm outputs, with high constant success probability, an estimate 
VC(G) such that VCopt(G) < VC(G) < 2VCopt(G) + en, where e is a given additive approximation 
parameter. We refer to such an estimate as a (2, e)-estimate. The query complexity and running time 
of the algorithm are 0{d ■ poly(l/e)), where d denotes the average vertex degree in the graph. The 
best previously known sublinear algorithm, of Yoshida et al. {STOC 2009), has query complexity and 
running time 0{d'^/e'^), where d is the maximum degree in the graph. Given the lower bound of 
(for constant e) for obtaining such an estimate (with any constant multiplicative factor) due to Parnas and 
Ron {TCS 2007), our result is nearly optimal. 

In the case that the graph is dense, that is, the number of edges is 8(?i^), we consider another model, 
in which the algorithm may ask, for any pair of vertices u and v, whether there is an edge between 
u and V. We show how to adapt the algorithm that uses neighbor queries to this model and obtain an 
algorithm that outputs a (2, e) -estimate of the size of a minimum vertex cover whose query complexity 
and running time are 0{n) ■ poly(l/e). 
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1 Introduction 



Computing the size of a minimum vertex cover in a graph is a classic NP-hai^d problem. However, one can 
approximate the optimal value of the solution to within a multiplicative factor of two, via a neat and simple 
algorithm whose running time is linear in the size of the graph (this algorithm was independently discovered 
by Gavril and Yanakakis, see e.g. IIPS98I ). 

A natural question is whether it is possible to obtain a good approximation for the size of the optimal 
vertex cover in time that is sublinear in the size of the graph G. Since achieving a pure multiplicative 
approximation is easily seen to require linear- time, we focus on algorithms that compute an estimate VC(G) 
such that with high constant probability, VCopt(G') < VC(G) < a • VCopt(G) + en, for a > 1 and 
< e < 1, where VCoptlG) denotes the minimum size of a vertex cover in G. We refer to such an estimate 
VC(G) as an (a, e)-estimate of VCopt(G). Observe that in the special case when the vertex cover is very 
large, namely VCopt(G') = Q{n) (which happens for example when the maximum degree and the average 
degree are of the same order), then an (a, e)-estimate yields an (a + 0(e))-multiplicative approximation. 

Since an algorithm with complexity sublinear in the size of the graph cannot even read the entire graph, 
it must have query access to the graph. In this work we consider two standard models of queries. In the first 
model, the algorithm may query the degree Aeg{v) of any vertex v of its choice, and it may also query the 
z*^ neighbor of v (where the the order on the neighbors is arbitrary). In the second model, more appropriate 
when the graph is stored as an adjacency matrix, the algorithm can check in a single query whether there 
is an edge between two vertices v and w chosen by the algorithm. We focus on the first model, but we 
eventually show that our algorithm can be modified to work in the second model as well. 

Previous work. The aforementioned question was first posed by Pamas and Ron IIPR07i . who showed how 
to obtain a (2, e)-estimate (for any given additive approximation parameter e) in time d'^(i°S'^/'^' ), where d 
is the maximum degree in the graph. The dependence on the maximum degree d can actually be replaced 
by a dependence on d/e, where d is the average degree in the graph IIPR07I . The upper bound of d'^(i°srf/^') 
was significantly improved in a sequence of papers ||MR09[ |NQ08[ IYYI09II . where the best result due to 
Yoshida, Yamamoto, and Ito IIYYI09I (who analyze an algorithm proposed by Nguyen and Onak IINO08II ) is 
an upper bound of 0{d'^ /e^). Their analysis can also easily be adapted to give an upper bound of 0((?/e^) 
for graphs with bounded average vertex degree d. 

On the negative side, it was also proved in IIPR07I that at least a linear dependence on the average 
degree, d, is necessary. Namely, U.{d) queries are necessary for obtaining an (a, e)-estimate for any a > 1 
and e < 1/4, provided that d = 0{n/a), and in particular this is true for a = 2. We also mention that 
obtaining a (2 — 7, e)-estimate for any constant 7 requires a number of queries that grows at least as the 
square root of the number of vertices IIPR07I due to Trevisan]. 

Our Result. In this work we describe and analyze an algorithm that computes a (2, e) -estimate of 
VCopt(G) in time 0{d) ■ poly(l/e). Note that since the graph contains dn/2 edges, our running time 
is sublinear for all values of d. In particular, for graphs of constant average degree, the running time is inde- 
pendent of the number of nodes and edges in the graph, whereas for general graphs it is bounded by at most 
the square root of the number of edges. In view of the aforementioned lower bound of Q{d), our algorithm 
is optimal in terms of the dependence on the average degree up to a polylogaiithmic factor. Since our algo- 
rithm builds on previous work, and in particular on the algorithm proposed and analyzed in IINO08[rYYI09ll . 
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we describe the latter algorithm first0 We refer to this algorithm as Approx-VC-I. 

The Algorithm Approx-VC-I. Recall that the size of a minimum vertex cover is lower-bounded by the 
size of any (maximal) matching in the graph, and is upper-bounded by twice the size of any maximal match- 
ing. This is indeed the basis of the aforementioned factor-two approximation algorithm, which runs in 
linear-time. To estimate the size of an arbitrary such maximal matching, the algorithm follows the sampling 
paradigm of Parnas and Ron IIPR07i That is, the algorithm Approx-VC-I selects, uniformly, independently 
and at random, /e^) edges. For each edge selected, it calls a maximal matching oracle, which we 
describe momentarily, where the oracle's answers indicate whether or not the edge is in the maximal match- 
ing M., for some arbitrary maximal matching M. (that is not a function of the queries to the oracle). The 
algorithm then outputs an estimate of the size of the maximal matching A4 (and hence of a minimum vertex 
cover) based on the fraction of sampled edges for which the maximal matching oracle returned a positive 
answer. The number of sampled edges ensures that with high constant probability, the additive eiTor of the 
estimate is 0{{e/d)m) < en, where m is the number of edges in the graph. 

The main idea of the algorithm follows the idea suggested in IINO08II which is to simulate the answers 
of the standard sequential greedy algorithm. The greedy algorithm supposes a fixed ranking (ordering) of 
the edges in G, which uniquely determines a maximal matching as follows: proceeding along the edges 
according to the order determined by the ranking, add to the matching each edge that does not share an end- 
point with any edge previously placed in the matching. The maximal matching oracle essentially emulates 
this procedure while selecting a random ranking "on the fly", but is able to achieve great savings in running 
time by noting that to determine whether an edge is placed in the matching, it is only necessary to know 
whether or not adjacent edges that are ranked lower than the cun^ent edge have been placed in the matching. 
Namely, given an edge {u, v), it considers all edges that share an endpoint with (n, v) and whose (randomly 
assigned) ranking is lower than that of {u,v). If there are no such edges, then the oracle returns TRUE. 
Otherwise it performs recursive calls to these edges, where the order of the calls is according to their ranking. 
If any recursive call is answered TRUE, then the answer on {u, v) is FALSE, while if all answers (on the 
incident edges with a lower rank) is answered FALSE, then the answer on (u, v) is TRUE. 

Though the con^ectness of the above algorithm follows directly from the correctness of the greedy al- 
gorithm, the query and runtime analysis are more difficult. The analysis of IINO08I is based on a counting 
argument that shows that it is unlikely that there is a long path of recursive calls with a monotone decreas- 
ing set of ranks. Their bound gives a runtime that is independent of the size of the graph, but exponential 
in the degree d. However, using that the algorithm recurses according to the smallest ranked neighbor, 
IIYYI09II give an ingenious analysis that bounds by 0{d) the total number of expected recursive calls when 
selecting an edge uniformly at random, and when selecting a ranking uniformly at random. This is what 
allows IIYYI09II to obtain an algorithm whose query complexity and running time are 0{d'^ /e^). 

Our Algorithm. In what follows we describe an algorithm that has almost linear dependence on the max- 
imum degree d. The transformation to an algorithm whose complexity depends on the average degree d is 
done on a high level along the lines described in IIPR07II . We first depart from Approx-VC-I by performing 
the following variation. Rather than sampling edges and approximating the size of a maximal matching by 

' Yoshida et aL IYYI09I actually analyze an algorithm for approximating the size of a maximal independent set. They then apply 
it to the line graph of a given graph G, so as to obtain an estimate of the size of a maximal matching, and hence of a minimum 
vertex cover (with a multiplicative cost of 2 in the quality of the estimate). For the sake of simplicity, we describe their algorithm 
directly for a maximal matching (minimum vertex cover). 
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calling the maximal matching oracle on the sampled edges, we sample vertices (as in IIPR07II ). and we call 
a vertex cover oracle on each selected vertex v. The vertex cover oracle calls the maximal matching oracle 
on the edges incident to v according to the order induced by their ranking (where the ranking is selected 
randomly). Once some edge returns TRUE, the vertex cover oracle returns TRUE, and if all incident edges 
return FALSE, the vertex cover oracle returns FALSE. By performing this variation we can take a sample of 
vertices that has size e(l/e2) rather thaii 6(^2 /e^). 

Unfortunately, the analysis of IIYYI09i is no longer applicable as is. Recall that their analysis bounds 
the expected number of recursive calls to the maximal matching oracle, for a random ranking, and for a 
randomly selected edge. In contrast, we select a random vertex and call the maximal matching oracle on 
its (at most d) incident edges. Nonetheless, we are able to adapt the analysis of IIYYI09II and give a bound 
of 0{d) on the expected number of recursive calls to the maximal matching oracle, when selecting a vertex 
uniformly at randomjl 

As a direct corollary of the above we can get an algorithm whose query complexity and running time 
grow quadratically with d. Namely, whenever the maximal matching oracle is called on a new edge {u, v), 
the algorithm needs to perform recursive calls on the edges incident to u and v, in an order determined by 
their ranking. To this end it can query the 0{d) neighbors of u and v, assign them (random) rankings, and 
continue in a manner consistent with the assigned rankings. 

To reduce the complexity of the algorithm further, we show a method that for most of the edges that 
we visit, allows us to query only a small subset of adjacent edges. Ideally, we would like to make only 
k queries when k recursive calls are made. One of the problems that we encounter here is that if we do 
not query all adjacent edges, then for some edge (u, v), we could visit a different edge incident to u and a 
different edge incident to v and make conflicting decisions about the ranking of (u, v) from the point of view 
of these edges. This could result in an inconsistent execution of the algorithm with results that are hard to 
predict. Instead, we devise a probabilistic procedure, that, together with appropriate data structures, allows 
us to perform queries almost "only when needed" (we elaborate on this in the next paragraph). By this we 
mean that we perform queries only on a number of edges that is a poly(log(d/e)) factor larger than the total 
number of recursive calls made to the maximal matching oracle. We next discuss our general approach. 

As in previous work, we implement the random ranking by assigning numbers to edges independently, 
uniformly at random from (0, 1] (or, more precisely, from an appropriate discretization of (0, 1]). For each 
vertex we keep a copy of a data structure that is responsible for generating and assigning random numbers to 
incident edges. For each vertex, we can ask the corresponding data structure for the incident edge with the 
i^^ lowest number. How does the data structure work? Conceptually, the edges attached to each vertex are 
grouped into "layers", where the edges in the first layer have random numbers that are at most 1/d, the edges 
in layer i > 1 have random numbers in the range 2^^^ jd to T jd. The algorithm randomly chooses edges 
to be in a layer for each vertex, one layer at a time, starting with the lowest layer. Each successive layer is 
processed only as needed by the algorithm. If the algorithm decides that an edge is in the current layer, then 
it picks a random number for the edge uniformly from the range associated with the layer. In particular, it 
is possible to ensure that the final random number comes from the uniform distribution on (0, 1]. In order 
to make sure that the same decision is made at both endpoints of an edge (n, v), the data structures for u 
and V communicate whenever they want to assign a specific random number to the edge. The algorithm 

^We note that it is actually possible to save one factor of d without changing the algorithm Approx-VC-I by slightly refining 
the probabilistic analysis. This would reduce the complexity of Approx-VC-I to cubic in d. 

""To be more precise, we first give a bound that depends on the ratio between the maximum and minimum degrees as well as on 
the average degree, and later we show how to obtain a dependence on d (at an extra cost of 1/e) by slightly modifying the input 
graph. 
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works in such a way so that vertices need query their incident edges only when a communication regarding 
the specific edge occurs. Our final algorithm is obtained by minimizing the amount of communication 
between different data structures, and therefore, making them discover not many more edges than necessary 
for recursive calls in the graph exploration. 



Other Related Work. For some restricted classes of graphs it is possible to obtain a (1, e) -estimate of 
the size of the minimum vertex cover in time that is a function of only e. Elek shows that this is the case 
for graphs of subexponential growth MElelOI . For minor-free graphs, one obtains this result by applying the 
generic reduction of Parnas and Ron IIPR07I to local distributed algorithm of Czygrinow, Harickowiak, and 
Wawrzyniak IICHW08II . Both of these results ai^e generalized by Hassidim et al. IIHKNQ09I to any class of 
hyperfinite graphs. In particular, for planar graphs, they give an algorithm that computes a (1, e) -estimate 
in 2P°^y(^/'^) time. While the running time must be exponential in 1/e, unless there exists a randomized 
subexponential algorithm for SAT, it remains a neat open question whether the query complexity can be 
reduced to polynomial in 1/e. 

For bipartite graphs, a (1, en)-estimate can be computed in dP^^I'^^) time. This follows from the relation 
between the maximum matching size and the minimum vertex size captured by Konig's theorem and fast 
approximation algorithms for the maximum matching size IINQ08[|YYI09II . 

Ideas similar to those discussed in this paper are used to construct sublinear time estimations of other 
parameters of sparse combinatorial objects, such as maximum matching, set cover, constraint satisfaction 
IINO08I |YYI09[ lYosl II . In the related setting of property testing, sublinear time algorithms are given for 
testing any class of graphs with a fixed excluded minor and any property of graphs with a fixed excluded 
minor IICSS09. BSSOB. lEleTOllHKNOOQilNSTTl . 

There are also other works on sublinear algorithms for various other graph measures such as the mini- 
mum weight spanning tree IICRT05I ICS091 ICEF+05l . the average degree IIFei06[lGR08l . and the number of 
stars HGRSIOL 



2 The Oracle-Based Algorithm 

Let G = {V, E) be an undirected graph with n vertices and m edges, where we allow G to contain parallel 
edges and self-loops. Let d denote the maximum degree in the graph, and let d denote the average degree. 
Consider a ranking it : E ^ [m] of the edges in G = {V, E). As noted in the introduction, such a ranking 
determines a maximal matching M'^(G). Given M'^(G), we define a vertex cover C^{G) as the set of all 
endpoints of edges in M^{G). Therefore, VCopt < |C'^(G)| < 2VCopt, where VCopt is the minimum size 
of a vertex cover in G. We assume without loss of generality that there ai^e no isolated vertices in G, since 
such vertices need not belong to any vertex cover. We shaU use the shorthand M'^ and for M^{G) and 
C'^(G), respectively, when G is clear from the context. 

Assume we have an oracle VO'^ for a vertex cover based on a ranking vr of the edges, where VO'^(u) = 
TRUE if w G C'^(G), VO'^(u) = FALSE Otherwise. The next lemma follows by applying an additive Chernoff 
bound. 

Lemma 2.1 For any fixed choice o/tt, let C = C'^(G). Suppose that we uniformly and independently select 
s = Q{\) vertices v from V. Let t be a random variable equal to the number of selected vertices that 
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belong to C. With high constant probability, 

ICI — en < - • n < ICI + en . 

s 

Algorithm[Tl provided below, implements an oracle VO'^, that given a vertex v, decides whether v € C^. 
This oracle uses another oracle, MO'^ (described in Algorithm ^ that given an edge e, decides whether 
e G M'^. Both oracles can determine 7r(e) for any edge e of their choice. The oracle MO'^ essentially 
emulates the greedy algorithm for finding a maximal matching (based on the ranking vr). We assume that 
once the oracle for the maximal matching decides whether an edge e belongs to M'^ or not, it records this 
information in a data structure that allows to retrieve it later. By Lemma lZTl if we perform 0(l/e^) calls to 
VO'^, we can get an estimate of the size of the vertex cover up to an additive error of (e/2)n, and hence 
we can obtain a (2, e)-estimate (as defined in the introduction) of the size of a minimum vertex cover in G. 
Hence our focus is on upper bounding the query complexity and running time of the resulting approximation 
algorithm when vr is selected uniformly at random. 



Algorithm 1: An oracle \0^{v) for a vertex cover based on a ranking tt of the edges. Given a vertex v, the 
oracle returns TRUE if v E and it returns FALSE otherwise. 

1 Let ei, . . . , ef be the edges incident to the vertex v in order of increasing rank (that is, 
7r(ei+i) > 7r(ei)). 

2 for i = 1, . . . , i do 

3 if MO''(ei) = TRUE then 

4 |_ return TRUE 

return FALSE 



Algorithm 2: An oracle MO'^(e) for a maximal matching based on a ranking tt of the edges. Given an edge 
e, the oracle returns TRUE if e G and it returns FALSE otherwise. 

1 if MO'^(e) has already been computed then 

2 1^ return the computed answer. 

3 Let ei , . . . , ef be the edges that share an endpoint with e, in order of increasing rank (that is, 

7r(ej+i) > 7r(ej)). 

4 i ^ 1. 

5 while 7r(ej) < 7r(e) do 

6 if MO'^{ei) = TRUE then 

7 |_ return FALSE 

8 else 

9 |_ i ^ i + 1. 
10 return TRUE 



We start (in Section O by bounding the expected number of calls made to the maximal-matching oracle 
yiO^ in the course of the execution of a call to the vertex-cover oracle VO'^. This bound depends on 
the average degree in the graph and on the ratio between the maximum degree and the minimum degree. 
A straightforward implementation of the oracles would give us an upper bound on the complexity of the 
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algorithm that is a factor of d larger than our final near-optimal algorithm. In Section |4] we describe a 
sophisticated method of simulating the behavior of the oracle MO'^ for randomly selected ranking vr, which 
is selected "on the fly". Using this method we obtain an algorithm with only a polylogarithmic overhead (as 
a function of d) over the number of recursive calls. Thus, for graphs that are close to being regular, we get 
an algorithm whose complexity is 0{d/e'^). In Section [5] we address the issue of variable degrees, and in 
particular, show how to get a nearly-linear dependence on the average degree. 

3 Bounding the Expected Number of Calls to the Maximal-Matching Oracle 

For a ranking vr of the edges of a graph G and a vertex v G V, let A^(vr, v) = NciT^, v) denote the number 
of different edges e such that a call MO'^(e) was made to the maximal matching oracle in the course of the 
computation of YO^{v). Let 11 denote the set of all rankings vr over the edges of G. Our goal is to bound 
the expected value of A^(7r, v) (taken over a uniformly selected ranking vr and vertex v). We next state our 
first main theorem. 

Theorem 3.1 Let G be a graph with m edges and average degree d, and let the ratio between the maximum 
degree and the minimum degree in G be denoted by p. The average value of N[tt, v) taken over all rankings 
vr and vertices v is 0{p ■ d). That is: 

m! n ^ — ' ^ — ' 

Tren fev 

If the graph is (close to) regular, then the bound we get in Theorem 13. II is 0{d) = 0{d). However, for 
graphs with varying degrees the bound can be Q{d'^). As noted previously, we later show how to deal with 
variable degree graphs without having to pay a quadratic cost in the maximum degree. 

As noted in the introduction, our analysis builds on the work of Yoshida et al. IIYYI09II . While our 
analysis does not reduce to their^, it uses many of their ideas. We start by making a very simple but useful 
observation about the maximal matching oracle MO'^ (Algorithm |2l), which follows immediately from the 
definition of the oracle. 

Observation 3.2 For any edge e, consider the execution of MO'^ on e. If in the course of this execution, 
a recursive call is made to MO'^ on another edge e! , then necessarily vr(e') < vr(e). Therefore, for any 
consecutive sequence of (recursive) calls to edges e^, . . . , e\, TT{ee) > vr(e^_i) > . . . > vr(ei). 

In order to prove Theorem 13.11 we introduce more notation. For any edge e G i?, we arbitrarily label its 
endpoints by Va{e) and vi,{e) (where if e is a self-loop then Va{e) = Vb{e), and if e and e' are parallel edges, 
then Va{e) = Va{e') and Vb{e) = Vb{e')). For a ranking vr and an index k, let vr^ denote the edge e such that 
vr(e) = k. 

We say that an edge e is visited if a call is made on e either in the course of an oracle computation of 
YO^ {va{e)) or YO'^ {vb{e)) (that is, as a non-recursive call), or in the course of an oracle computation of 

''indeed, we initially tried to find such a reduction. The main difficulty we encountered is that the vertex cover oracle, when 
executed on a vertex v, performs calls to the maximal matching oracle on the edges incident to v until it gets a positive answer (or 
all the incident edges return a negative answer). While the analysis of IYYI09I gives us an upper bound on the expected number 
of recursive calls for a given edge, it is not clear how to use such a bound without incurring an additional multiplicative cost that 
depends on the degree of the vertices. 
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MO'^(e') for an edge e' that shares an endpoint with e (as a recursive call). For a vertex v and an edge e, let 
X'^{v, e) = Xq{v, e) equal 1 if e is visited in the course of the execution of VO'^(u). Using the notation 
just introduced, we have that 

Ni-K^v) = Y^X^v.e) . (2) 

eS-B 

Observation 3.3 Let e = {v, u). If X^{v, e) = 1, then for each edge e' that shares the endpoint v with e 
and for which 7r(e') < 7r(e) we have that MO'^(e') = FALSE. 

To verify Observation 13.31 assume, contrary to the claim, that there exists an edge e' as described in the 
observation and MO'^(e') = TRUE. We first note that by Observation 13. 2[ the edge e cannot be visited in 
the course of an execution of MC^ on any edge e" = {v, w) such that vr(e") < 7r(e) (and in particular this 
is true for e" = e'). Since VO'^(u) performs calls to the edges incident to v in order of increasing rank, 
if MO'^(e') = TRUE, then VO'^(w) returns TRUE without making a call to MO'^(e). This contradicts the 
premise of the observation that X'^{v, e) = 1. 

The next notation is central to our analysis. For k G [m] and a fixed edge e: 

Xk{e) = J2 (^"K(vrfc), e) + X^Vh{7Tk), e)) . (3) 
Tren 

That is, Xk{e) is the total number of calls made to the maximal matching oracle on the edge e when summing 
over all rankings vr, and performing an oracle call to the vertex-cover oracle from one of the endpoints of 
vTfc. Observe that 

m 

Y^Xkie) = j;deg(i;)-X'^(t;,e) (4) 
k=i -n-eiivev 

where deg(f ) denotes the degree of v in the graph, and for simplicity of the presenation we count each 
self-loop as contributing 2 to the degree of the vertex. We next give an upper bound on Xfc(e). 

Lemma 3.4 For every edge e and every k € [m]: 

Xk{e) < 2(m - 1)! + {k - 1) ■ {m - 2)1 ■ d . (5) 

In order to prove Lemma l34l we establish the following lemma. 

Lemma 3.5 For every edge e and every k G [m — 1]: 

Xk+i{e) - Xkie) < (m - 2)1 ■ d . (6) 

Before proving Lemma [3751 we show how Lemma l34l easily follows from it. 

Proof of Lemma l374t We prove the lemma by induction on k. For the base case, k = 1, 

Xi{e) = (^X^va{7ri),e)+X^vt{7ri),e)) . (7) 

By the definition of the vertex-cover oracle, when starting from either fa(vri) or from ^^(vri), only a single 
call is made to the maximal matching oracle. This call is on the edge vri, which returns TRUE without making 
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any further calls, because all edges (that share an endpoint with tti) have a larger rank. This implies that if 
e = TTi, then X'^ (tti ) , e) = X'^ {vb{iTi) , e) = 1, and otherwise X'^{va{'rTi),e) = X'^{vb{7ri),e) = 0. 
For any fixed e, the number of rankings vr such that e = vri is simply (m — 1)! and so Xi{e) = 2{m — 1)!, 
as required. 

We now turn to the induction step. Assuming the induction hypothesis holds for A; — 1 > 1, we prove it 
for k > 1. This follows directly from Lemma [375] (and the induction hypothesis): 

Xkie) < Xfc„i(e) + (m-2)!-d (8) 
< 2{m-iy. + {k-2)-{m-2y.-d+{m-2)l-d (9) 
= 2(m - 1)! + (A; - 1) • (m - 2)! • d , (10) 

and the lemma is established. ■ 

Proof of Lemma IXSt Throughout the proof we fix k and e. For a ranking vr, let vr' be defined as follows: 

T^'k+i = ^k, K = and tt'j = nj for every j ^ {k,k + 1}. 

Observation 3.6 If tt and vr' are as defined above, then for each edge e where 7r(e) < k (and therefore, 
7r'(e) < k): MO^(e) = MO'^'(e). 

Observation l3.6l is true due to the fact that if 7r(e) < k then by the definition of vr', we have that vr'(e) = vr(e). 
Since in a recursive call we only go to an edge with a lower rank (see Observation 13.21 ). we get that the 
execution of MO'^(e) is equivalent to the execution of MO'^ (e). 

We shall use the notation 11^ for those rankings vr in which vr^^^ and vr^+i share a common endpoint. Note 
that if vr € 11^, then vr' G li^ as well (and if vr ^ 11^, then vr' ^ Ilfc). For two edges e = {vi,V2) and 
e' = (^21^3) (which share a common endpoint V2), we let Vc{e,e') = Vc{e' ,e) = V2 ('c' for 'common') 
and Vd{e, e') = f 1, Vd{e', e) = ('d' for 'different'). If e and e' are parallel edges, then we let Vd{e, e') = 
Vd{e', e) be Va{e) = Va{e') and fc(e, e') = Vc{e\ e) be Vh{e) = t'b(e'). If e is a self-loop on a vertex vi that 
is also an endpoint of e' (so that V2 = f 1), then fd(e, e') = Vc{e, e') = vi. 
For any edge e and for 1 < /c < m — 1, 

Xk+i{e) - Xk{e) 

= J](x-(t;,(vrfc+i),e)+X^(7;b(vrfc+i),e)) -Y,[x^ {va{TTk),e) + X^ {vb{T^k),e)) (11) 

TT TT 

= (^"K(vrfc+i),e)+X-(t;fe(vrfc+i),e)) - J] (x^ {va{^u),e) + X^ {vb{^k),e)) 

+ (x^{va{Trk+i).e) + X^{vb{TTk+i).e)) - Y {x^Va{7Tk),e)+X^{vb{7Tk),e)yi2) 

= Y X'"{vc{TTk+i,T^k),e) - Y ^''(■yc(vrfc,7rfc+i),e) (13) 
+ ^ X^(vrf(vrfc+i,vrfc),e) - ^ X^(vd(^fc,vrfe+i),e) (14) 
+ Y {x''{va{Trk+i),e) + X^Vb{Trk+i),e)^ - ^ (^"(t^a(vrfc), e) + X-(z;b(vrfc), e)) .(15) 
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By the definition of Vc{-, •), for every vr G Ilfc we liave tliat fc(vrfc+i, Tr^) = Vc{TTk,T^k+i) and so 

X''{vc{TTk+i,Trk),e) = X'^{vciTrk,TTk+i),e) , (16) 

implying that the expression in Equation (fT3] ) evaluates to 0. Since it' £ Ilk if and only if vr € Ilfc, we get 
that 

^ X''{vd{TTk+i,TTk),e) = X'''{vd{TTk+i,TT'k),e) = ^ X""' {vd{7T'k+i, tt',,) , e) , (17) 

TTGHfe TT'eHfe TTGHfe 

and 

(^X^Va{7rk+i),e)+X^vt,{7rk+i),e)]= J] (x-'(7;,(vr^+J, e) + X-'(^fe(7r;,+i), e)) 

= (x-'(t;,(4+i),e) + X-'(t;,«+i),e)) . (18) 

Therefore, 

Xfc+i(e) - Xfe(e) = ^ X'''(i)rf(7rfc+i,7rfc),e) - ^ X''(?;d(7rfc,7rfc+i),e) 

+ J] (x-'(t;„(7r[,+i),e)+X-'(^bK+i),e)) 
- (X"(7;a(7rfc),e)+X"(^;fe(7rfc),e)) . (19) 

The next useful observation is that for every vr ^ 11^ (and for every e and j € {a, b}), 

X^'{vj{7ri^,),e)=X^Vj{7rk),e) . (20) 

This follows by combining the fact that fj(vr[,_|_^) = fj(vrfc) with Observations 13.21 and 13.61 
By combining Equation ( fT9l ) with Equation (|20l ) we obtain that 

Xfc+i(e) -Xfc(e) = ^ X-'(i;rf«,+i,7r;:),e) - ^ vrfc+i), e) . (21) 

Tren^ weUk 

Therefore, we need only consider executions in which the underlying rankings vr and vr' belong to 11^, 
and the execution starts from the vertex vi{7t') = 'i'd(7r[,^^, vr^) = Vd(vrfc, tt^+i). We shall use the shorthand 
notation 't;2(7r') = -yc(7r^_^i, vr^) = Vc{TTk,-n:k+i), and 1)3 (vr') = ^^(vr^, 7r[„+i) = ^^(vrfc+i, vr^). For an 
illustration, see Figure [T] We shall make use of the following simple observation. 

Observation 3.7 Let e be a self-loop. For any vertex v and ranking vr, if in the course of the execution of 
\0'^{v) a call is made to MO''(e), then MO''(e) = TRUE. 

Observation 13.71 is true since if a call is made to MO'^(e) where e is a self-loop, i.e., e = {v, v) for some 
vertex v, then from Observation 13 . 3 1 we know that all other edges incident to v with ranks smaller than 7r(e) 
return FALSE. Therefore, by the definition of MC^ we get that MO'^(e) = TRUE. 

We would like to understand when X"^' {vi{'K'),e) = 1 while X"^ {vi{'K'),e) = 0. We consider three 
possible cases (for an illustration see Figure ^ : 
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Figure 1 : An illustration for the various cases in which tt £ Ilfc (i.e., ttJ. and ir'^.j^i share at least one endpoint) and we 

need to compare the executions of V0'^(wi(7r')) and VO'^ (ui(7r')) (where ui(7r') = Wd(7''fe4_i, '''fe) = ^^dl^rfc, Tr^+i)). 
We refer to the different cases (a)-(f) in the analysis. 



1. e = (ui(7r'),W2(vr')) (so that 7r'(e) = A: + 1 and 7r(e) = k). In this case, if X'^' {vi{TT'),e) = 1, 
then X'^ {vi{TT'),e) = 1. To verify this, note that if X'^ {vi{-K'),e) = 1 then by Observation 13.31 
MO'^ (e') = FALSE for each edge e' where vi{-k') is one of its endpoints and vr'(e') < A; + 1. By 
applying Observation 13.61 we get that for each edge e' such that '/r(e') < A; we have that MO'^(e') = 
MC^ (e'). Therefore, for each edge e' such that 7r(e') < k and vi{tt') is one of its endpoints we have 
thatMO''(e') = MO'''(e') = FALSE. Hence X"" {vi('K'),e) = 1. 

We note that if vr^ is a self-loop (see cases (c) and (f) in Figure [B, then by Observation 13 .7 1 we have 
that M0'''(7r^) = TRUE. By the definition of VC^' this imphes that 7r^_^]^ = e will not be visited in 
the course of the execution of VC^ {vi{'k')), so that X'^' {vi{t^'), e) is necessarily 0. 

2. e = {v2{'^')^v^{tt')), (so that 7r(e) = A; + 1 and vr'(e) = A:). In this case it is possible (though not 
necessary) that X'" {vi{'K'),e) = 1 and X''{vi{'k'), e) = 0. 

3. e ^ {{vi{'k'),V2{'k')), (^2(71'), ?;3(7r'))}. In this case it is also possible (though not necessary) that 
X^'(T;i(7r'),e) = 1 and X^(i;i(7r'), e) = 0. 

Out of all cases illustrated in Figure [H this is possible only in cases (a) and (b). We next explain why 
it is not possible in all other cases. 

• Case (c). If VC^ {vi{'k')) visits ebefore it visits vr^,, then so does V0'^(t;i(7r')) (from Observa- 
tion l3.6l ). Otherwise, \0^' {vi (vr')) visits vr^ first, but since it is a self-loop, from Observation l3.7l 
we have that MC' (vr^J = TRUE. By the definition of VC' we get that X^' {vi (vr'), e) = 0. 

• Case (d). If VC^ (fi(7r')) visits e before it visits vr^,,).^, then so does V0'^(fi(7r')) (from Ob- 
servation [321). Otherwise, if V0'^'(t;i(7r')) visits vr^^^ and e in the same sequence of recur- 
sive calls without visiting tt^, then so does V0'^(ui(7r')). If there is no such sequence, then 

{vi{k')) will visit vr^^^ and vr^. Since 7r[. is a self-loop, from Observation l3.7l we have that 
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M0'^'(7r^) = TRUE, implying that M0'^'(7r^^^) = false. Therefore, the sequence of recur- 
sive calls that visits e in the execution of VO'^ {vi{tt')), starts from an edge incident to fi(7r') 
whose rank is greater than k + 1, and the same sequence of calls is made in the execution of 

VO^(vi(^'))- 

• Case (e). Since the edges are parallel, if there is a sequence of recursive calls that visits e in 
the execution of VO'^ {vi{'k')), then there is such a sequence in the execution of V0'^(t'i(7r')), 
where the only difference is that the first sequence includes vr^ while the second includes vr^ 
(which are parallel edges). 

• Case (f). If VO'^ {vi (vr')) visits e in a sequence of recursive calls that starts with an edge having 
rank smaller than k, then from Observation 13.61 so will V0'^(t;i(7r')). Otherwise, since vr^ is a 
self-loop, by Observation 13. 7[ if a call is made to MO'^ i'^'k)' ^^^^ returns TRUE, causing the 
execution of VC^ (fi(7r')) to terminate without visiting any additional edges (so that e cannot 
be visited in a sequence of recursive calls that starts with an edge having rank at least k). 



1. 



2. 



3. 




Figure 2: An illustration for the three possible (sub-)cases when tt' G 11^: 1. e = {vi{tt'),V2{tt')); 2. e = 
{v2{tt'),V3{tt'));3. e ^ {(ui(7r'), U2(7r')), (u2(7r'), W3(7r'))}. This illustration corresponds to Case (a) in Figure[T](i.e., 
no self-loops and no parallel edges). 

For a fixed edge e we shall use the following notation for the sets of rankings that con^espond to the last 
two cases described above. Specifically: 

• Let n*^'^ = n^'^ denote the set of all rankings vr' G Ilfc where e = (^2(71"'), ^3(7'"')) ^^'^ 
X"^' {vi{Tr'), e) = 1. (Here we shall make the worst case assumption that X'^{vi{tt'), e) = 0). 

• Let H""^ = n^*^ denote the set of all rankings vr' G 11^ where e ^ {(^1(71'), ?;2(vr')), {v2{'^'),V3{tt'))} 
andX'^'(t;i(7r'),e) = 1 while X''{vi{Tr'),e) =0. 

Thus, Xfc_|_i(e) — Xk{e) < + |n~"^|. In order to upper bound in*^'^! + |n~'^|, we consider another set 

of rankings: 
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• Let n*^''' = n^' denote the set of all rankings vr' G 11^ such that e = {v2{'n''),V3{TT')) and 
X^'{vi{Tr'),e)=0. 

By the definition of 11^'^ and U.'^'^, we have that 

|n^'^| + |n^'°| < (m-2)!-(i. (22) 

This is true since each ranking vr' G 11'^'^ U 11'^''^ is determined by first setting vr'(e) = k, then selecting 
another edge incident to the endpoint 1^2 (vr') of e (if e is a self-loop then V2{tt') = vi{tt')) and giving it rank 
k + l (where there are at most deg(t;2(vr')) — 1 < d — 1 such edges), and finally selecting one of the possible 
{in — 2)1 rankings for the remaining edges. We next prove that |n^^| < |n'^''^|, from which Lemma 1331 
follows. 

To this end we prove the next claim. 

Claim 3.8 There is an injection from n~"^ to n'^'". 

The proof of Claim [3^ is very similar to a proof of a corresponding claim in IIYYI09II . but due to our need to 
extend the proof to a graph with self-loops and parallel edges, and also due to several additional differences, 
we include it here for completeness. 

Proof: We start by making the following observations: 

Observation 3.9 /fvr' G n~"^ and we are in Case (a) as illustrated in Figure\l\ then in the course of the 
execution ofYO^ {vi{'k')) there is a consecutive sequence of recursive calls that includes vr^^^, vr^ and e 
at the end. That is, there is a sequence of recursive calls corresponding to a path of edges (e^, e£_i . . . ei) 
such that ei = vr^^-^, e^-i = vr^ and ei = e. 

To verify Observation 13.91 note that since vr' G n~"^ we know thatX'^ (t;i(7r'),e) = 1 and X'^(7;i(7r'), e) = 
0. The only difference between the execution of VO'^ (fi(7r')) and V0'^(fi(7r')) is that MO'^ Wk+i) '^^'^ 
call MO'^Vfc) but M0''(7r^+;^) = MO"" {-Kk) cannot call M0''(4) = M0''(7rfc+i). Thus, the only way 
that VO'' {vii-K')) and V0''(fi(7r')) will create different sequences of recursive calls is when VO'^ {vi{Ti')) 
calls M0'''(7r^^J and then M0'''(7r[,_^-^) calls M0'''(7r^). Furthermore, these two calls have to be one after 
the other, since by Observation 13.21 the ranks can only decrease in a sequence of recursive calls. 

Observation 3.10 vr' G n~"^ and we are in Case (b) as illustrated in Figure |7] then in the course of 
the execution ofYO^ (fi(7r')) there is a consecutive sequence of recursive calls that starts with vr^, and 
ends with e (so that, in particular, it does not include vr^^j^j. That is, there is a sequence of recursive calls 
corresponding to a path of edges {ee-i • • • ei) such that ee-i = vr^ and ei = e. 

To verify Observation 13. 101 note that since vr' G n~"^ we know that X'^' {vi (tt') , e) = 1 and X'^ (vi (vr') , e) = 
0. The execution of YO^ {vi{7r')) cannot visit e in the course of a sequence of recursive calls starting 
from an edge incident to vi{tt') where the edge has ranking smaller k. Otherwise, from Observation 13.61 
we would get that V0'^(fi(7r')) also visits e which contradicts the premise that vr' G n~"^. We also 
know that YO^ (fi(7r')) cannot visit n'f^^i. If it would have, then since it is a self-loop, from Observa- 
tion [3i71 M0'^'(7r^_^^) = TRUE, causing V0'^'(fi(7r')) to terminate without visiting e, which contradicts 
X^'{vi{7T'),e) = l. 
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We shall now prove Claim fTM Let vr^ be a ranking in n~"^ (so that 7r^(e) ^ {k,k + 1}). By the 
definition of 11^'^ and by Observations 13.91 and 13. 101 we have the following. In Case (a), the execution of 
VO'^ {vi{tt^)) induces a sequence of (recursive) calls to the maximal matching oracle, where this sequence 
corresponds to a path P = (e^, . . . , ei) such that ee = ttI_^i, ei-i = vr^, and ei = e. In Case (b), the 
execution of VO'^ (7;i(7r^)) induces a sequence of (recursive) calls to the maximal matching oracle, where 
this sequence corresponds to a path P' = (e^-i, . . . , ei) such that e£_i = tt\, and ei = e. Since in Case 
(b) P is also a path in the graph, we may refer to the path P in both cases (and take into account, if needed, 
that in Case (b) = 7r^,_^ is a self-loop and is not part of the sequence of recursive calls that reaches e). 
While we do not know the rankings of the edges e£_2, . . . ei, we know from Observation 13.21 that they are 
in monotonically decreasing order, and that they are all smaller than k. We also know that the path does 
not include any parallel edges. This is true since if et and ct-i are adjacent edges in the path P and they 
are parallel edges, then from Observation 13.1 II 7r^(e/_i ) < ir'{et). But since they are parallel, they have 
the same endpoints, therefore, by the definition of YO^ and of MC^ , the vertex/edge from which the call 
was made, would have called MC^ (ej-i). Furthermore, with the exception of 7r[,^^ in Case (b), 
the the only edge along the path P that might be a self-loop is e. Otherwise, from Observation 13.71 the 
self-loop will return true, and thus path P will not visit e. 

We can write P as P = (vr^^^j, • • • '''"^(i)) where a{i) = TT^{ei), so that a{£) = k + 1 and a{£ — I) = k. 
We next define a mapping ip between rankings, such that </3('/r^) = vr'^, where we shall show that vr^ € n*^'*^, 
and that 99 is one-to-one. The ranking vr'' is defined as follows by "rotating" the ranks of the edges on P (and 
leaving the ranks of all other edges as in vr^). Namely, 7r'^(e2) = A; + 1, 7r''(ei) = k, and vr'^(ej) = a{j — 2) 
for every 3 < j < £. For an illustration, see Tabled] We first verify that 99 is a projection from H""^ to W'^. 











es 


62 


e\= e 


Rank in vr^ 


a{i) = k + l 


a{e-l) = k 




a(3) 


a{2) 


a(l) 


Rank in vr'^ 


aii-2) 


a{i - 3) 




a{l) 


a{i) = k + l 


a{l-\) = k 



Table 1: Ranking of P = (e^, . . . , ei) in vr^ and in vr' 



Namely, we need to show that: 

, i.e., vr^^-|^ and vr^ share an endpoint V2{t^'^\ and e — (^^2('^'')) ''^sC'^'^))- 

• X''"(t;i(7r°), e) = (that is, the execution of V0''''(wi(7r°)) does not create a call to MO''°(e)). In 
other words, (the execution of) VC^ {v\(Tpy) does not visit e. 

The first item directly follows from the definition of ■k^ . We thus turn to the second item. Recall that by 
our notational convention, v\{t:^^ = fd(vr^_,_j^, vr^) = Vd{e2, ei) (i.e, it is the endpoint that 62 does not share 
with ei) so that it is the common endpoint of 62 and 63, i.e., Vc{e2, 63). Since 

7r0(e3) = a{l) < a{£) = k + l = ^\e2) , (23) 

the execution of V0'^'(?;i(7r*')) will visit 63 before visiting 62. Since Tr^{e) = k, during the execution of 
YO''"{vi{tt^)), the call to M0''''(e3) will not cause a recursive call to MO''''(e). 

Observe that in the execution of VC^ (t;i(7r^)), the call to MC^^es) creates a recursive call on 62 
(since 62 follows 63 on the path P). Therefore, it must be the case that MC^ (e')=FALSE for every e' that 
has a common endpoint with 63 and such that 7r^(e') < cr(2). By the definition of ip, all edges that are not 
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on the path P have the same ranks in vr" and in vr^. Therefore, all edges with rank lower than a{l) have 
the same rank in vr^ and in vr*^. It follows that for every e' that has a common endpoint with 63 and such 
that 7r^(e') < cr(2), MO''''(e') = FALSE. We can conclude that M0''"(e3) = TRUE and so VO'''' ('t;i(7r0)) 
returns TRUE without visiting ei = e, as required. 

It remains to show that tp is an injection from H""^ to 11'^''^. Assume, contrary to the claim, that 99 is not 
an injection. That is, there are two different rankings vr^ 7^ vr^ € 11^'^ where (p{iT^) = ^{t^"^)- Let P^ = 
(ej^ , e\^__^ . . . e\) and P^ = (e|^ , • • • ef ) be the paths that con^espond to the sequence of recursive calls 

to the maximal matching oracle, in the executions of VO'^^ {vi{tt^)) and VO^^ {vi{'k'^)) respectively, where 
e\ = e\ = e, 7r^(ejj = ■K'^{e\^) = k + 1 and vr^(ej^_-^) = TT'^{ej^_-^) = k (recall that if tt^ corresponds to 
Case (b), then ej^ is a self-loop and is not actually part of the sequence of recursive calls that reaches e, and an 
analogous statement holds for vr^). Let s be the largest index such that {el, el_i • • • e}) = (e^, el_i . . . ef ). 
We denote this common subsequence by (e^, Cs-i ■ ■ ■ ei). Observe that s > 2. This is true since: (1) By the 
definitions of the paths, e\ = e\ = e, and (2) given that ^^(vr^) = </?(7r^) = vr^ and 'K^{e\) = 'K^{e^) = k + 1, 
it holds that 62 = e^. 

By the definitions of tp and s we have that 7r^(ej) = '/r^(ej) for each i € [s — 2]. Thus, o"i(z) = (T2(i) 
for each i € [s — 2], where we shall sometimes use the shorthand a{i) for this common value. For an 
illustration, see Table |2l 





Rank from y'(vr^) 


Rank from (/?(vr^) 




ai{li -l) = k 


CT2{i2 -l) = k 




(Ji{h) = k + l 


a2{l2) = k + l 




^i(l) 










7r"(e,_i) 


o"i(s - 3) 


<T2(S - 3) 




cji(s-2) 


<T2(S - 2) 



Table 2: Ranks of edges e\ = e\. . . e]._2 = el_2 are equal in and tt^ 

The next observation will be useful. 

Observation 3.11 For every edge e', if Tr^{e') < min{(7i(s — 1),(T2(s — 1)} or 7r^(e') < mm{ai{s — 
l),f72(s- 1)}, themr^e') = TT^{e'). Therefore, MO'''(e') = MO''\e') for e' such that n^e') = TT^{e') < 
min{(Ti(s - l),(T2(s - 1)}. 

We consider two cases: 

1. is a suffix of P^ or P^ is a suffix of P^. Without loss of generality, assume that P^ is a suffix of 
P\ so that s = £2- 

2. Otherwise (neither path is a suffix of the other), assume without loss of generality that ai{s — 1) < 
^2(5-1). 

In both cases, since el_^_i is not on P^, tp, when applied to vr^ does not change the ranking of ej^^. That is, 
'^^{el_^^l) = T^^iel^i). Since (by the definition of ip) 7r''(e]^;^) = ai{s — 1), we get that 

7r\el+,) = ai{s - I) = 7rHel_,) . (24) 
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In the first case (where is a suffix of P^), we have that cj2(s — 1) = k, while ai{s — I) < k, and so 

ai(s-l) <a2(s-l) (=7r2(e2_J) . (25) 
In the second case, this inequality was made as an explicit assumption. 




Figure 3: An illustration for the proof of Claim l3.8l 

We thus have that the execution of MO'^^(e^) visits e]^-^ before visiting e^^i. We would like to under- 
stand what occurs in the call to MO'^^ (^s+i)- If we are in Case (b) and P^ = (tt^,,.^, vr^., e), i.e., s = k, then, 

since el_^_i = ttI^^ is a self-loop, from Observation [3/71 MO''^(e]^i) = TRUE. Hence MO'^^ {e^ = e]) 
returns FALSE without visiting e^^i = e, but this stands in contradiction to the definition of P^. If we are in 
Case (a), then since the path P^ corresponds to a sequence of recursive calls to the maximal-matching oracle, 
we have that for every edge e' that shares an end-point with el^i and such that 7r^(e') < ai{s) = 7r^{el), 
the call to MO'^ (e') returns FALSE. Combining this with Observation 13.1 11 we get that for every edge 
e' that shares an end-point with el_^_i and such that 7r^(e') < o"i(s), the call to MO'^ {e') returns FALSE. 
By Equation (|24l) we get that MO'^'^ {el_^_i) returns TRUE. Hence, MO'^^(e^ = e^) returns FALSE without 
visiting e^_i, but this stands in contradiction to the definition of P^. ■ (Claim IXSl ) 
Having established Claim [X8l the proof of Lemma [33] is completed. ■ 

We are now ready to prove Theorem l3.1l 
Proof of Theorem |3.1t Recall that d denotes the maximum degree, d denotes the average degree and p 
denotes the ratio between the maximum degree and the minimum degree, which is denoted by dmin (where 
the latter is at least 1 since we assumed without loss of generality that there are no isolated vertices). By 
combining Equations (ID and (01) and applying Lemma 13.41 (as well as recalling that we counted each self- 
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loop as contributing 2 to the degree of a vertex), we get that: 




111 ( , 

< — - • — • — ; ■ m ■ [ m ■ 2[m — 1)1 

ml n 2drain \ 

= o(!^.^) =0(p.d). 



m ■ m — 1 



• (m - 2)! • d 



(26) 



(27) 



2 



(28) 



and we obtain the bound claimed. ■ 

4 Limiting the Exploration of Neighbor Sets 

The analysis in the previous section suffices to show an algorithm whose query complexity and running time 
are a factor of d larger than the number of oracle calls that it makes. The factor of d in this expression is 
due to querying all edges that are incident to the endpoints of each edge for which a call to the maximal 
matching oracle is made (where, as we explain momentarily, a random ranking can be selected in an online 
fashion). 

This section is devoted to a method for selecting incident edges of low rank efficiently without querying 
entire neighborhoods of relevant vertices. By applying the method, we reduce the query complexity and the 
running time by a factor of almost d. The main challenges here are to ensure that the ranks of encountered 
edges indeed come from the uniform distribution over all permutations and that the same decision with 
respect to a rank of an edge is made at both endpoints of the edges. 

Replacing a random ranking witli random numbers. The oracle construction described as Algorithm [T] 
and Algorithm |2] uses a random ranking ir : E ^ [m] of edges. We start by replacing a random ranking 
of edges with random real numbers in (0, 1] selected uniformly and independently for every edge e ^ E, 
yielding a vector a : E ^ [0,1] which we use in the same way as the ranking vr. Since the probability that 
two edges are assigned the same real number is 0, whenever the oracle compares the ranks of two edges e 
and e', it can check whether cj(e) < cr{e'), instead of whether 7r(e) < vr(e'), effectively yielding a random 
ranking of edges. Since each a{e) is independent, this small conceptual shift allows one to generate a{e) at 
random in an easier manner and to simplify the analysis. Though it is not possible to generate and store real 
numbers in (0, 1], we later introduce a proper discretization. 

4.1 A Data Structure for Accessing Neighbors 

The oracle described as Algorithms [T] and |2] always collects all edges around the vertex or edge being 
considered and sorts them to explore them recursively in increasing order of their random numbers. In this 
section we introduce a data structure that is responsible for generating the random numbers and providing 
edges for further exploration in the desired order. 

For every vertex u € we have a copy neighbors[w] of the data structure. (In fact, a copy for a 
given vertex is created when it is accessed for the very first time.) From the point of view of the exploration 
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algorithm, the data structure exposes only one operation: lowest(/^), where A; is a positive integer. The 
operation neighbors[7;].lowest(/c) lists edges incident to v in order of the random numbers assigned to 
them, omitting all appearances of parallel edges or self-loops except the first one, which has been assigned 
the lowest number. For each positive k, the operation returns a pair {w, r), where w is a vertex and r is a 
number in (0, 1] U {oo}. If r 7^ oo, then {v, w) is the edge with the k^^ lowest number in the above order, 
and r is the number assigned to it. Otherwise, the list is shorter than k and r = 00 indicates the query 
concerned a non-existing edge. We present the implementation of the data structure in Section l44l 

We rewrite Algorithms [T]and|2]to use the data structure, and present them as the oracle YO^{v) in 
Algorithm |3] and the oracle MO'^(e) in Algorithm HI respectively. 

Algorithm 3: An oracle VO"^ (v) for a vertex cover based on the input from the data structures neighbors, 
which assigns edges e random numbers cr(e) (online). Given a vertex v, the oracle returns TRUE if v belongs 
to the corresponding vertex cover and it returns FALSE otherwise. 

1 i:=l 

2 (tf, r) := neighbors['L'].lowest(z) 

3 while r / 00 do 
if MO''{{v,w)) = TRUE then 

|_ return TRUE 

6 i := i + 1 

7 (tf, r) := neighbors[«].lowest(i) 

8 return FALSE 



Algorithm 4: An oracle MO'^((tt, v)) for a maximal matching based on the input from the data structures 
neighbors, which assigns edges e random numbers cr(e) (online). Given an edge {u, v), the oracle returns 
TRUE if (it, v) belongs to the corresponding matching and it returns FALSE, otherwise. 

1 if MO'^{(u, v)) has already been computed then 

2 |_ return the computed answer 

3 /ci := 1 and k2 := 1 

4 {wi,ri) := neighbors[u].lowest(A;i) 

5 {w2,r2) := neighbors[v].lowest(fc2) 

6 while wi ^ V or W2 ^ u do 



7 


if ri < r2 then 


8 




if MO'^{{u, wi)) = TRUE then return false 


9 




ki -.^ki + l 


10 




{wi,ri) :=neighbors[u].lowest(A;i) 


11 


else 


12 




if MO°"((v, W2)) = TRUE then return false 


13 




k2 := /C2 + 1 


14 




{w2,r2) := neighbors[v].lowest(A;2) 



15 return TRUE 
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Claim 4.1 Let a be an injection from E to (0, 1]. Let vr : — > ^E\] be the corresponding ranking defined 
in such a way that for every edge e, cj(e) is the 7r(e)*'^ lowest number in the set {(T(e') : e' € E}. 

For every vertex v, the answer returned by VO'^(u) (Algorithm\3} is the same as the answer returned 
by VO'^{v) (AlgorithmU} provided the operation lowest works as specified and gives answers consistent 
with G. 

Proof: It is easy to verify that the claim holds when there are no parallel 

edges. This is true because when there are no pai^allel edges, a sequence of calls to 
neighbors[T;].lowest(l), neighbor s[f].lowest(A;) simply returns the first k edges inci- 
dent to V in order of increasing rank. Furthermore, when called on an edge (n, v). Algorithm ineffectively 
merges the two con^esponding lists of adjacent edges (i.e., those incident to u and those incident to v) to 
obtain a single list sorted according to rank, and makes recursive calls in the order dictated by the list. 

It remains to verify that the same is true when there are parallel edges. For a fixed choice of g and 
the induced ranking vr consider the two trees of recursive calls when calling VO'^(f) (Algorithm [Hi and 
NO"{y) (Algorithm |3]l, where the former calls the oracle M'^ (Algorithm |2ll, and the latter calls the oracle 
M'^ (Algorithm 131). When we refer to an edge in in the tree we actually mean an occurence of an edge in G 
on a path of recursive calls. 

These trees are both rooted at v, and with each edge there is an associated rank (number) and an asso- 
ciated answer computed by the corresponding maximal matching oracle. Recall that each path of recursive 
calls from the root to a leaf passes through edges with decreasing ranks (numbers). Furthermore, in both 
trees, if an edge (n, v) in the tree is associated with the answer FALSE, then there must be an edge (n, vS) (or 
(v, w)) adjacent to it in the tree with lower rank (a "child" of this edge) that is associated with the answer 
TRUE, and it is the highest ranking child that (u, v) has. If [u, v) is associated with the answer TRUE, then 
all the children of {u, v) in the tree are associated with the answer FALSE. It will actually be convenient to 
consider the full recursion trees without the "memoization" rule that we employ (which says that once an 
answer is determined for an edge it is not computed again). This in particular implies that for each edge that 
is the last edge on a path of recursive calls, the answer associated with it must be TRUE. 

By the definition of YO^{v) and the operation lowest, the tree corresponding to VO'^(f) contains 
only edges that have minimal ranking among each set of parallel edges that connect a pair of vertices. We 
claim that this tree is a "pruned" version of the tree that corresponds to YO'^{v), in the sense that all subtrees 
containing non-minimally ranked parallel edges are removed, and the answers associated with the remaining 
edges (and hence with the root v) are exactly the same. 

Let T'^{v) denote the tree of recursive calls for \0'^{v), and let h be the height of T^{v). Starting 
from i = h and going up the tree, we show that we can remove all non-minimally ranked parallel edges in 
level £ of T^{v) without altering the answer for their parent edges. For £ = h, we claim that there are no 
non-minimally ranked parallel edges in the last level of T'"{v), so that no pruning needs to be performed. 
To verify this, assume in contradiction that e is a non-minimally ranked parallel edge between vertices u 
and w where e is at the end of a recursive path of length h in T'^(f). Since e is not minimally ranked, there 
should be a "sibling" of e in the tree which correspond to the minimally ranked edge e' between u and w. 
Since vr(e') < vr(e), it must be the case that the answer associated with e', that is, M'^(e'), is FALSE. But e' 
also belongs to level h, so that e' is the last edge on a path of recursive calls, and hence cannot be answered 
FALSE. 

Assuming we have performed the pruning successfully for all levels i < £' < h, we show that we can 
perform it for level £. Consider a non-minimally ranked parallel edge e between vertices u and v in level i 
of T'^{v). As argued above, there is a "sibling" of e in the tree which correspond to the minimally ranked 
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edge e' between u and w. Since vr(e') < 7r(e), it must be the case that the answer associated with e', that 
is, M'^(e'), is FALSE. This implies that e' has a child e" in the tree resulting from pruning all non-minimal 
parallel edges from levels £' > £, such that M'^{e") = TRUE. But since vr(e") < 7r(e') < 7r(e), and e" is 
also adjacent to e, we get that M'^(e) is FALSE as well. Hence, it is possible to prune e from the tree without 
altering the answer obtained for its parent. ■ 

4.2 Implementing lowest: The High-Level Idea 

The pseudo-code for the procedure lowest as well as the data structure that it uses, ai^e given in full detail 
in Subsection 14.41 Here we give a high-level description. For the sake of simplicity of the presentation, in 
this description we assume that there are no parallel edges. 

Roughly speaking, the procedure lowest for a vertex v is implemented in "batches". Namely, con- 
sidering intervals of (0,1] of the form (2^*, 2^*+^] (for i € [d^], where = [logd], as well as the 
interval (0, 2""^*]), the procedure does the following. It first decides which edges incident to v should be 
assigned a value in the cuiTcnt interval (2~*, 2"*+^]. In this stage each edge is identified with its label (in 
{1, . . . , deg{v)}). The procedure then determines the identity of the other endpoint of each of these edges 
by performing a neighbor query, and it assigns the edge a value a{{v,w)), selected uniformly at random 
from the interval. This assignment is performed unless a certain constraint is discovered due to information 
held in neighbor s[7i;], as we explain subsequently. Once a{{v, w)) is determined, the other endpoint of 
the edge, w, is "notified". That is, the data structure neighbors[i(;] is updated with this new information. 
The procedure "opens" a new interval (2~*+^, 2~'+^] if the index k it is called with is such that the number 
of neighbors wofv whose identity has been revealed and such that (t{{v, w)) < 2~*+^ is strictly less than k. 
Thus, the procedure performs queries and assigns valued to edges "on demand", but it does so for "batches" 
of edges. More precise details follow. 

The data structure neighbors maintains two values for each vertex v: lb, and next_lb (where the 
latter is always twice the former). When a vertex is first encountered, lb is set to and next_lb is set 
to 2~'^*. Second, the data structure maintains a dictionary assigned_number, which holds, for those 
vertices w that are known to be neighbors of v, the value w)) that was assigned to the edge between 
them (initially, the dictionary is empty). The subset of indices in {1, ... , deg(w)} that correspond to edges 
for which the other endpoint has not been revealed (and do not yet have an associated value), are considered 
unassigned. Third, the data structure maintains a list of pairs {w, r), where u; is a (known) neighbor of v 
and r = cr((t;, w)). This list is sorted in ascending order of r's, and it contains exactly those w for which 
the coiTcsponding r is at most lb. 

If a call is made to neighbor s[w].lowest(fc) with A; > deg(u) then it return^ (w, oo). Otherwise, the 
procedure does the following until the length of sorted is at least k. It first considers those edges {v, w) 
incident to v that were already assigned a value r and this value belongs to the interval (lb, next_lb] (that 
is, as signed_number[?i;] G (lb, next_lb]). The setting of the value r for each such edge {v,w) was 
performed previously in the course of call to neighbors[t(;].lowest(-). Let the con^esponding subset of 
pairs {w, r) be denoted S. 

The procedure next selects a subset T of {1, . . . ,deg(u)} containing the labels of those (additional) 
edges that it will (tentatively) assign a value in (lb,next_lb]. Putting aside for now the issue of time- 
efficiency (which we return to later), this can be done by flipping a coin with bias "^^f'j^^"*"^ indepen- 

^Recall that we assume that there are no parallel edges, or else (v, oo) is returned if k exceeds the "effective" degree of v, that 
is, counting parallel edges as a single edge. 
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dently for each edge label in the subset of unassigned edge labels. For each t £ T, the procedure now 
performs a neighbor query to obtain the t^^ neighbor of v. Denoting this neighbor by w, let lb' denote 
the lower bound lb held by w, that is, in the data structure neighbors[it;]. If lb' < lb, so that the 
lower bound constraint imposed by w is no larger than that imposed by v, then the following operations are 
performed. 

First, a random number r in the interval (lb,next_lb] is selected uniformly at random, and 
assigned_number[i(;] is set to r. In addition, assigned_number[ii] is set to r in the data struc- 
ture neighbors[i(;] (so that w is "notified" of the revealed edge {v,w) as well as the assignment 
r = cf{{v, w))). Finally, the pair {w, r) is added to S. 

If lb' > lb, which means that lb' > next_lb (given the way the intervals are defined), then the lower 
bound constraint imposed by the end point w of the edge {v, w) does not allow the edge to be assigned a 
value in the interval (lb,next_lb], and so effectively its selection to T is retracted. Note that since the 
decision whether an edge label is added to T is done independently for the different edges, the end effect (of 
not assigning {v, w) a value in (lb,next_lb]) is exactly the same as the one we would get if we had the 
knowledge in advance (before selecting T), that the con^esponding edge label t should not be selected. 

After going over all labels t in T, the resulting set S of pairs {w, r) is sorted in ascending order of r's, 
and it is appended to the end of the list sorted. Thus, sorted now includes all pairs {w, r) such that w is 
a neighbor of v, the value assigned to this edge is r, and r < next_lb. The variables lb and next_lb are 
then updated so that lb is set to next_lb and next_lb is set to 2 • next_lb. Once the length of sorted 
is at least k, the procedure returns sorted[/c]. In Subsection l4.4l we formally establish that the distribution 
of random numbers the data structures neighbors[t;] provide access to via the operation lowest(A;) is 
the same as assigning independently at random a number from the range (0, 1] to each edge. 

4.3 Generating Random Numbers 

In this subsection we describe a random process that generates random numbers a{e) for edges e ^ E. The 
procedure lowest applies this process in the course of its executions. In the remainder of this section, \Z\ 
denotes the length of an arbitrary real interval I. We do not distinguish open and closed intervals here. For 
instance, |(0,1)| = j[0,l]| = |(0,1]| = |[0,1)| = 1. 

Let d be an upper bound on the maximum vertex degree. We set d^, = [log d] . For every edge e, the 
number cr(e) should be selected independently, uniformly at random from the range (0, 1]. We partition this 
range into + 1 intervals. We set 




{2-\2"'+^] forieK], 
(0,2-^^*] fori = 4 + 1. 



Algorithm 5: A Process for Selecting a Random Number Assigned to an Edge 
1 for i ^ + 1 downto 2 do 

\X- 1 

with probability — \t \ • return a number selected from Ij uniformly at random (and 

^ l^l<j<i 

terminate) 

3 return a number selected from Zi uniformly at random 

We describe our process as Algorithm [5] The process first selects one of the intervals Xj, and then selects 
a number uniformly at random from this interval. The selection of the interval is conducted as follows. We 
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consider the intervals in reverse order, from to Zi. For a considered interval, we decide that the 

number belongs to this interval with probability equal to the length of the interval over the sum of lengths of 
all the remaining intervals. The process selects each interval with probability proportional to its length, and 
since the lengths of all intervals sum up to 1 , the number that the process returns is uniformly distributed on 
the entire interval (0, 1]. 

Corollary 4.2 Algorithm\5\selects a random number from the uniform distribution on (0, 1]. 

Note that by simulating a few iterations of the loop in the above process, one can decide that the number 
assigned to a given edge is either a specific number less than or equal to 2~*, or that it is greater than 2~* 
without specifying it further, for some i. Later, whenever more information about the number is needed, 
we may continue with consecutive iterations of the loop. As we see later, we use the process in our data 
structures neighbor s[t;] to lower the query and time complexity of the resulting vertex cover algorithm. 

4.4 Data Structures 

We now describe the data structures neighbors[w]. Each data structure neighbors[w] simulates the 
random process described in Section 14.31 for all edges incident to v in the course of the executions of 
neighbors[i;]. lowest. The data structure simultaneously makes a single iteration of the loop in Al- 
gorithm [5] for all incident edges. It may be the case that for some edge {v,w), the random number has 
already been specified. In this case, the result of the iteration for this {v, w) is discarded. It may also be 
the case that this iteration of the loop has already been taken care of by neighbor s[ii;], the data structure 
for the other endpoint of the edge. The data structures communicate to make sure that a second execution 
of a given iteration does not overrule the first. The data structures are designed to minimize the amount of 
necessary communication. Note that if a data structure does not have to communicate with a data structure 
at the other endpoint of a given neighbor, it does not even have to know the neighbor it is connected to with 
a given edge, which can be used to save a single query. By using this approach, we eventually save a factor 
of nearly d in the query complexity. 

Each data structure neighbors[t;] supports the following operations: 

neighbors[t;].lowest(A;): As already mentioned, this is the only operation that is directly used by the 
oracles (Algorithm [3] and Algorithm |4l). It returns a pair iw, r), where (u, w) is the edge with the A;*^^ 
lowest random number assigned to edges incident to v, omitting a second and furher appearances for 
parallel edges, and r is the random value assigned to {v,w). If r = oo, then k is greater than the 
length of such a defined list. 

neighbors[t;].lower±>ound(): The operation returns the current lower bound the data structure im- 
poses on the edges that are incident to v and have not been assigned a specific random number yet. 
The set of possible values returned by the procedure is {0} U {2* : — < i < 0}. Let 1^ be the num- 
ber returned by the operation. It implies that the data structure simultaneously simulated the random 
process described in Section 14.31 for incident edges until it made sure that the random numbers that 
have not been fixed belong to {l^ , 1] . 

Furthermore, let {v, w) be an edge in the graph. Let £y and £^ be the numbers returned by the 
operation for neighbors[f] and neighbors[t(;], respectively. If no specific random number has 
been assigned to {v,w), then we know that the random number will eventually be selected uniformly 
at random from {m.ax{£y , i^} , 1]. 
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neighbor s[t;].set_value(zi;, r): It is used to notify the data structure neighbors[v] that the ran- 
dom value assigned to (f , w) has been set to r. This operation is used when the data structure 
neighbors[it;] assigns a specific random number to {v,w). Before assigning r, the data struc- 
ture neighbors[tt;] has to make sure that r > neighbors[t'].lower±>ound(), i.e., it has not 
been decided by the data structure neighbors[f] that the random number assigned to v is greater 
than r. 

To implement the above operations, each data structure neighbors[u] maintains the following infor- 
mation: 

lb: The variable specifies the lower bound on the incident edges that were not assigned a random number 
yet. This is the value returned by the operation neighbors[i']. lower jDound(). This is also the 
value at which the simulation of the process generating random number for edges incident to v has 
stopped. 

next_lb: If specific random numbers assigned to more edges are necessary, the next considered range of 
random numbers will be (lb, next_lb], and next_lb will become the new lower bound for the 
edges that have not been assigned any random number. This variable is redundant, because its value 
is implied by the value of lb, but using it simplifies the pseudocode. 

assigned_number: This is a dictionary that maps neighbors of t; to numbers in (0, 1]. Initially, the dic- 
tionary is empty. If as signed_number [id] = NONE, i.e., there is no mapping for w, then no specific 
random number has been assigned to any of the edges {v, w) yet. Otherwise, assigned_number[iy] 
is the lowest random number that has been assigned to any parallel edge {v,w). 

sorted: This is a list consisting of pairs [w, r), where lu is a neighbor of v and r is the number assigned 
to the edge {v,w). It is sorted in ascending order of r's, and it contains exactly those w for which 
the edge {v, w) (with the lowest assigned random number) has an assigned random number less than 
or equal to lb. For all neighbors w that do not appear on the list, the lowest number assigned to any 
edge {v, w) is greater than lb. 

We give pseudocode for all data structure operations as Algorithms |6l El [H and|9] We postpone all issues 
related to an efficient implementation of the data structure to Section 1431 Three of them are straightforward, 
and we only elaborate on the operation neighbors[t;].lowest(A;) (see Algorithm |9ll. 



Algorithm 6: The procedure for initializing neighbors[v] 

1 lb := 

2 next.lb := 2^^^* 

3 assigned_number := {empty map} 

4 sorted := {empty Ust} 



Algorithm 7: The procedure neighbors[t;].set_value('u;, r) 
1 assigned_number[t(;] := r 
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Algorithms: The procedure neighbors[i']. lower jDound() 
1 return lb 



As long as not sufficiently many lowest random numbers assigned to edges incident to v have been 
determined, the operation lowest simulates the next iteration of the loop in the random process that we 
use for generating random numbers. Let / be the interval (lb, next_lb]. The operation wants to determine 
all random numbers assigned to edges incident to v that lay in /. First, in Line 2, it determines the random 
numbers in / that have already been assigned by the other endpoints of corresponding edges. In Line 3, the 
operation simulates an iteration of the loop of the random process for all edges incident to v to determine 
a subset of them that will have numbers in / (unless it has already been decided for a given edge that its 
random number is not in /). In the loop in Line 4, the operation considers each of these edges. Let {v, w) 
be one of them, where w is its other endpoint, queried by the operation. In Line 6, the operation generates 
a prospective random number r G / for the edge. First, the operation makes sure that this iteration of the 
has not been simulated by the other endpoint (the condition in Step 7). If this is the case, the operation 
considers two further cases. If r is lower than the lowest number assigned to any parallel edge {v, w) so 
far, the procedure updates the appropriate data structures with this information (Steps 8-1 1). If no random 
number has ever been assigned to any edge {v, w), the procedure assigns it and updates the data structures 
appropriately (Step 12-15). When the operation finishes going over the list of all potentially selected edges 
and eventually determines all incident edges with new lowest random numbers, it sorts them in order of 
their random number and appends them in this order to the list sorted. Finally, when sufficiently many 
edges with lowest numbers have been determined, the operation returns the identity of the edge with the k^^^ 
smallest number. 

Lemma 4.3 The lists of incident edges that the data structures neighbors[t'] provide access to via the 
operation lowest(A;) are distributed in the same way as when each edge is assigned independently at 
random a number from the range (0,1]. 

Proof: We know from Corollary 14.21 that the random process generates a random number from the distribu- 
tion (0, 1]. Each data structure neighbors[w] simulates consecutive iterations of the loop in this process 
for all edges incident to v. Consider a group of parallel edges {v,w). For each of these edges, the ran- 
dom process is simulated by both neighbors[v] and neighbors[ii;]. We have to show that until the 
lowest number assigned to the edges in this group is determined (which happens when it is added to the 
list sorted), then for each edge the decision made in the first simulation matters. Why is this the case? 
Recall that the random process considers intervals Id*+i> 2^d*> . . . , Ii as the sources of the random number 
in this order. As long as both neighbors[w] and neighbors[t(;] reject a given interval their decisions 
are the same, so the first decision is in effect. Now suppose without loss of generality that neighbors [it;] 
simulates a consecutive iteration of the loop in the random process and decides to use Xi as the source of the 
random number for a given edge {v, w) in Step|9]of the operation lowest. If neighbor s[t;] has already 
simulated this iteration (the condition verified in Step 7), the operation does not proceed. Otherwise, the ran- 
dom number assigned to the edge is considered for a new minimum random number assigned to this group 
of parallel edges. Note that since the operation keeps simulating iterations even after a random number is 
assigned, it could be the case for a specific copy of {v, w) that a new, higher random number is considered, 
but it is ignored, because it is higher than the first decision, which is the only one that has impact on the list 
that the operation lowest provides access to. 
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Algorithm 9: The procedure neighbors[t;].lowest(A;) 



1 while length( sorted) < k and lb < 1 do 

S := set of pairs {w, r) such that assigned_number[i(;] = r and r € (lb, next_lb] 
T := subset of {1, ... , deg(f )} with each number included independently 

with probability next_lb-lb 
foreach t € T do 



i-lb 



w 



t^^ neighbor of v 



r := a number selected uniformly at random from (lb, next_lb] 
if neighbors[it;].lowerJDound() < lb then 
if 3(i«, r') € S s.t. r < r' then 

as signed_number[?i;] := r 
neighbor s [if], set _value(w, r) 
replace {w, r') with ('u;, r) in S 

if assigned_number[t(;] = NONE then 
assigned_number[u;] := r 
neighbors [if], set _value(t', r) 
S := S^{{w,r)} 



Sort S in ascending order of their r, and append at the end of sorted 

lb := next_lb 
next_lb := 2 • next_lb 



19 if length(sorted) < A; then return {v,oo) 

20 else return sorted[fc] 



The correctness of the data structure follows from the fact that it extends the list sorted by always 
adding all edges with random numbers in a consecutive interval, and it always takes into consideration 
decisions already made by data structures for the other endpoints for these intervals. ■ 

4.5 Query Complexity 

We now show that the number of queries that the algorithm makes is not much higher than the number of 
recursive calls in the graph exploration procedures. The following simple lemma easily follows from the 
Chemoff bound and will help us analyze the behavior of the algorithm. 

Lemma 4.4 Let Xi, .. ., Xg be independent random Bernoulli variables such that each Xi equals 1 with 
probability p. It holds: 

• For any 6 G (0, 1/2), 

< 6 -111(1/(5) •max{l,ps}. 

i 

with probability at least 1 — 5. 
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For any 5 G (0, 1/2), if ps > 81n(l/5), then 



with probability at least 1 — 6. 



Proof: Let us first prove the first claim. If 6 • \n{l/5) ■ max{l, ps} > s, the claim follows trivially. 
Otherwise, there exist independent Bernoulli random variables Yi, I < i < s such that for each i, 

Pr[Yi = 1] = 3 • Hi/ 6) ■ max{l/ s,p} > p 

since from the definition of 5: 3 • \n{l/6) > 1. Therefore Pr[Xj = 1] < Pr[l^ = 1]. By this fact and by the 
Chernoff bound, 

Pr[^X,>2E[^Y^] < Pr[^Y,>2E[^Yi\] 

< exp(— ln(l/5) • max{l,ps}) 

< exp(-ln(l/5)) < 5. 

The second claim also directly follows from the Chernoff bound: 

Pr[^Xi < ps/2] < exp(-(l/2)2 .ps/2) < 6. 



Definition 4.5 Denote Ji = IJj=l^ where 1 < i < d^, + 1. For example: J\ = (0, 1] and Jdi,+i = 
(0, We expect that the number of incident edges to v with random numbers in Ji to be deg(t') • \Ji\. 

We now define a property of vertices that is useful in our analysis. Intuitively, we say that a vertex is "usual" 
if the numbers of incident edges with random numbers in specific subranges of (0, 1] ai"e close to their 
expectations. 

Definition 4.6 Let a > 0. We say that a vertex v is a-usual if the random numbers assigned to edges 
incident to v have the following properties for all i £ {1, . . . ,di, + 1}: 

• Upper bound: The number of incident edges with random numbers in Ji is 

at most max{a, a ■ deg(f ) • \ JiW- 

• Lower bound: If deg{v) • |j7i| > a, then the number of edges with random numbers in J is 

at least deg{v) ■ \Ji\/2. 

We now basically want to show that the relevant vertices are a-usual, and later on we will use it to prove 
a lower bound. 

We define an additional quantity that is useful later in bounding the total running time of the algorithm. 
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Definition 4.7 For an execution of Step^of Algorithm^where the number of neighbors is k andp G [0, 1] 
is the probability of selecting each of them, we say that the toll /or running it is kp. 

We now prove a bound on the query complexity of the algorithm and other quantities, which are useful 
later to bound the running time. We start by introducing the main Lemma (Lemma l48] ). followed by proving 
Lemma \4~9\ which will help us prove Lemma 

Lemma 4.8 Consider an algorithm A that queries the input graph only via the oracle described as Algo- 
rithm\J} Let t > 1 be the expected resulting number of calls in A to the oracles described as Algorithm\J\ 
and Algorithm^ Let d be an upper bound on the maximum degree of the input graph. 

Suppose now that we run this algorithm replacing calls to AlgorithmUlwith calls to Algorithm\3\ The 
following events hold all at the same time with probability 1 — 1 /20.' 

L The total number of calls to Algorithms\3\and^is 0{t) 

2. The operation lowest in data structures neighbors[T;] runs at most 0{t) times. 

3. The query complexity of A is 0(t ■ \o^{dt)). 

4. The total toll for running Step^of Algorithm^is 0{t ■ log{dt)). 

Before proving Lemma |4T8] we establish the following Lemma: 

Lemma 4.9 Assume the conditions of Lemma \4.8\ Let t' = WOt, 5 = l/(40000t(d + l)(d^ + 1)), and 
a = 8 ■ ln(l/5). The following three events happen with probability less than ^ for each: 

L The total number of calls to Algorithm\3\and Algorithm^is bounded by t'. 

2. The first 2t' vertices for which the operation lowest is called are a-usual. 

3. For the first 2t' vertices v for which the operation lowest is called, the size of the set T generated 
in the j^^ execution of Step^of the operation is bounded by a ■ max{l, deg{v) ■ 2^~'^*}. 

Proof: For every group of parallel edges, the operation lowest lists only the edge with the lowest number. 
For the purpose of this analysis we assume that the operation lists in fact all occurences of a given parallel 
edge. The final complexity is only reduced because of the fact that some unnecessary calls ai^e omitted. 

1. Let us bound the probability that one of the above events does not occur. By Markov's inequality the 
probability that the first event does not occur is bounded by 

2. We shall now prove that the first 2t' vertices for which the operation lowest is called are a-usual. 
The total number of vertices that have an incident edge for which the process generating random 
numbers is simulated in the above calls is bounded by 2t' • (d + 1). The property of being a-usual is 
a function of only random numbers assigned to incident edges. 

For J'i let X = J2j=i -^j where p = Pr[Xj = 1] = \ J'i\, s = deg{v), i.e. X is the number of all 
incident edges to v with random numbers in J'i. From Lemma |44] we get that: 

Pr[^Xi > a-max{l,|j;|deg(t;)}] = Pr[^Xi > 8 • ln(l/5) • max{l,ps}] 

i i 

< Pr[^Xi > 6-ln(l/(5) •max{l,_ps}] < (5 
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Also, from Lemma l4!4l we get that: 



i i 

i.e. V is not a-usual because of Ji with probability less than 25. From union bound on all all i G 
[di, + 1] we get that vertex v is not a-usual with probability less than 26 {d^, + 1). 
Using the union bound again, this time over the vertices incident to edges for which the random 
process is run, the probability that any of them is not a-usual is bounded by 

2t' • (d + 1) ■ 26id^ + 1) = A00tS{d + l){d^ + 1) = -1-. 

3. We need to prove that for the first 2t' vertices v for which the operation lowest is called, 
the size of the set T generated in the j^^ execution of Step |9] of the operation is bounded by 

a ■ max{l, deg(w) • 2^~'^*}. 

Let V be one of the first 2t' vertices for which the operation neighbors[w]. lowest is called. Ob- 
serve that in the j*'^ iteration of the loop while, (next_lb — lb)/(l — lb) is at most 2^"'^*. Therefore, 
it follows from Lemma 1441 that for each j € {1, . . . , + 1}, the size of the set T in Algorithm |9] 
selected in the j^^ execution of Step |9] is bounded by a ■ max{l, deg(u) • 2^~'^*} with probability 
1 — 5. By the union bound over all j and the first 2t' vertices, the probability that the third event does 
not occur is bounded by 

2t'(4 + l)S = 200t(d^ + 1) • l/(40000t(d + l){d^ + 1) < — 



Summarizing, the probability that at least one of the three events does not occur is bounded by 

3 1 
100 ^ 20 

Let us now prove Lemma [48] assuming that the events in Lemma |49l occur. 
Proof of Lemma l48t 

1. We need to prove that the total number of calls to Algorithms [3] and |4] is 0{t). This follows directly 
from Lemma [4!9l we proved it there for t' = 0{t). 

2. We need to show that the operation lowest in data structures neighbors[f] runs at most 0{t) 
times. 

The total number of vertices v for which the operation neighbors[w]. lowest is called is bounded 
by 2t' , because a call to one of the oracles (Algorithms [3] and requires calling the operation 
lowest for at most two vertices. It follows from the implementation of the oracles that the op- 
eration neighbors[t;]. lowest is executed at most ?>t' = 0{t) times if the number of oracle calls 
is bounded by t' (which was proved in Lemma 14.91 ). This is true because in Algorithm |3] we call 
neighbor s[f]. lowest once and in Algorithmic we call neighbor s[u]. lowest twice. 
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3. We will now show that the query complexity of ^ is 0(t • log^ {dt)). 

For each vertex v, denote ky € [0, deg(f )] the number of times we call neighbors[i;].lowest(fc) 
on V. We assume that if the operation is not executed for a given vertex, then ky = 0. It holds that: 

We now attempt to bound the query complexity necessary to execute the operation 
neighbors[?;]. lowest for a given v such that ky > 0. Note that the expected number of edges with 
random numbers in a given J'i is deg(f)/2*~^. Recall that from Lemma [49l we know that the first 
2t' vertices for which the operation lowest is called are a-usual. From the lower bound of a-usual 
(Definition 14.51 ) we get that If deg{v) • |j7i| > a, then the number of edges with random numbers in 



J'i is at least 
Therefore, if 



deg{v) ■ \J,\/2. 



deg{v) ■ \ Ji\ = deg(t;)/2*-^ > max{2a,2ky} 
then the number of edges with random numbers in J'i is at least 

max{2a,2A;^} , i ^ , 
= maxja, ky\ > 

i.e. if i is such that deg(w)/2*~^ is at least max{2a, 2kr^}, then at least ky edges incident to v have 
random numbers in Ji. This also holds for i such that deg(t;)/2*~^ > 2aky. Let i„ be the largest 
integer i such that 2* < ^^^^ (remember i = d^^+i, • • • )■ Since iy is the maximum i that satisfies 



this, then 



2i„+i > deg(^) ^ ^i^ ^ deg{v) ^ 2„^^ ^ 2aky 



aky 2aky deg{v) 

The body of the loop while in Algorithm |9] is executed at most + 2 — times for v (remember 
we start from i = d^+i), independently of how many times the operation is executed for v, because 
all relevant edges incident to v are discovered during these iterations. From Lemma 14.91 we know 
that the size of the set T in Algorithm |9] selected in the j^^ execution of this loop is bounded by 
a ■ max{l, deg(?;) • 2^~'^*}. Furthermore, the sum of sizes of all sets T generated for v is bounded by 

di,+2—iy 

\ ^ „, ™„,. ri J — ,('„.\ oi— d*l ^ „J J I i\ I o„, J — ,/'„.\ o2— i„ 



a ■ max{l, deg(?;) • 2^"'** } < a{d^ + 1) + 2a • deg{v) ■ 2' 



< a{d^ + 1) + Wa^ky. 



This also bounds the number of neighbor queries for v. Since these are the only neighbor queries in 
the algorithm, by summing over all v with ky > 0, the total number of neighbor queries is bounded 
by 

2t' ■ a{d^ + 1) + X] 16a^^i, < 200at{d^ + 1) + 16a^ • 300t = 0{at{d^ + a)) = 0{t ■ \og^[dt)). 

(Recall t' = 200t and that YlveV — Note that degree queries appeal" only in Step |9] of the 
operation neighbors[f]. lowest with one query to discover the size of the set from which a subset 
is selected. The number of degree queries is in this case bounded by the total number of executions of 
SteplH which is at most 0{t ■ log d). Summarizing, the total query complexity is 0{t ■ \og^{dt)). 
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4. Finally, we need to prove that the total toll for running Step|9]of Algorithm |9] is 0{t ■ \og{dt)). Recall 
that the toll is defined as kp where k is the number of neighbors and p is the probability to selecting 
each of them in an execution of Step|9]of Algorithm|9l Using arguments as above, the toll for running 
Step|9]in the operation neighbors[w]. lowest for a given v is bounded by 

y deg(w) • 2^-'^* < 2 • degiv) ■ 2^-'- < 8 • degfi;) • -^Vr = ^^oiK 
^ deg(w) 

By summing over all vertices v, we obtain a bound on the total toll: 

^ IQak^ < 4800at = 0{t ■ log{dt)). 



4.6 Efficient Implementation 

We have aheady introduced techniques that can be used to show an approximation algorithm whose query 
complexity has near-linear dependence on the maximum degree d. Unfortunately, a straightforward imple- 
mentation of the algorithm results in a running time with approximately quadratic dependence on d. The 
goal of this section is to remove a factor of approximately d from the running time of the algorithm. Our 
main problem is how to efficiently simulate Step |9] in the operation lowest. Note that Step |9] is sampling 
from a binomial distribution. 

First, in Lemma 14.1 1[ we prove that there is an algorithm that can simulate a binomial distribution which 
runs in efficient time. Finally, in Theorem I4.13[ we will show how to use it in our algorithms and how to 
bound the running time by 0(t • log^((it)). 
We start by defining the binomial distribution. 

Definition 4.10 We write B{k,p), where k is a positive integer and p € [0, 1], to denote the binomial 
distribution with success probability p o« {0, 1, . . . , k} distributed as Yl!i=i where each Xi, 1 < i < k, 
is an independent random variable that equals 1 with probability p, and with probability 1 — p. 

It is well known that the probability that a value drawn from the binomial distribution B{k,p) equals q 
is (^)p'^(l — pY^'i. We now show how to efficiently sample from this distribution. 

Lemma 4.11 Let a, h, k, and Q be positive integers, where a < b and Q > 1, that can be represented in 
the standard binary form, using a constant number of machine words. There is an algorithm that takes a, 
b, k, and Q as parameters, runs in 0(max{/i;a/&, 1} • logQ) time, and outputs an integer selected from a 
distribution V on {0, 1, . . . , A;} such that the total variation distance between V and B(k,a/b) is bounded 
by l/Q. 

Proof: If a = b, then the algorithm can return the trivial answer in 0(1) time, so we can safely assume for 
the rest of the proof that a < b. Let p = a/b and let qi = (^)p*(l — be the probability of drawing 
i from B{k,p). Let s = min{6 ■ ln{2Q) ■ max{l, ka/b}, k}. For each z < s, we compute a real number 
q[ € [0, 1] such that qi — q[ < l/2{k + l)Q and X^^^g I'i ~ ^ (details about how to compute those q'- are 
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given in Lemma 14.121) . Then we select the output of the algorithm from the distribution given by q['s. We 
write V to denote this distribution. 

Let us bound the total variation distance between this distribution and B{k,p). It suffices to show that 
for every subsets 5" of {0, ... , k}, the probability of selecting an integer from 5 in B{k, p) is not greater by 
more than l/Q, compared to the probability of selecting an integer in S from V. Consider an ai^bitrary such 
set 5*. Let be the subset of 5 consisting of numbers at most s. Let 52 be the subset of 5 consisting of 
integers greater than s. We have 

i&s i&Si ieSi ^ \ < / \i(zsi J ^ 
Recall that X/ = 1 with probability p. If s = k then 

k k 
Pr[J^X, >s] = [Y.Xi>k] =0 

i=l 1=1 

If s = 6 • ln(2(5) • max{l, ka/b} then we define 6 = and from Lemma |44] we have that 

k 



Pr[^ > 6-ln(i) •max{l,p/c}] <6 



i=l 

Hence, 



In other words: the probability that a number greater than s is being selected from B{k,p) (i.e. s Xj's are 
1) is bounded by Therefore, 



<2^ 



(30) 



From [29] and [30l we get: 




Therefore, 



which proves our Lemma. Next, in Lemma l4.12l we will also show that the running time of the algorithm is 
0(s) = 0(max{A;f,l}log(Q)). ■ 

We now describe how to compute values that are approximation to qi. 

Lemma 4.12 Recall: a < b, p = a/b and qi = — (probability of drawing ifrom B{k,p)). 

Let s = min{6 • ln{2Q) ■ max{l, ka/b}, k}. For each i < s, we can compute a real number q'- G [0, 1] such 
that Qi — q[< l/2{k + \)Q and "^11=0 4i = 1- ^'^^ total running time is 0{max{ka/b, 1} • logQ). 
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Proof: Observe that for 1 < i < A;: 

k + 1 — i p 



Let = -22- for < i < s. It holds that for 1 < i < s: 



1 — p 



k + 1 — i p k + 1 — i a 

U = ti-i ■ ■■ • = ti-i ■ ■■ • 7 (31) 

I 1 — p I — a 



Note that for < i < s: 

H _ qo 



> Qi (32) 

Suppose now that instead of ti, we use t- > 0, < i < s, such that \ti — < ™4^(fc°_^j'pQ^ ■ Then from the 
definition of t • we get: 

maxo<j<^tj 
4(fc + l)2Q 

Also: 



We have 



t, ^ 4(fc+l)^Q 



(From [32I and that maxo<j<s tj < Ylj<s *i) — 



+ 4{fc+l)Q) ■ Sj<s *i (1 + 4(fc+l)Q ) ■ Sj<s *i 



4(fc+l)Q' 

- '^'^ " 4(A; + l)Q^ " 4(fc + l)2Q 
1 1 



So eventually we get that 



4(fc + l)g 4(A; + l)2Q-^* 2(fc + l)Q' 
< (35) 



Z,<st', - 2ik + l)Q 



T 



Also, note that X]i=o v ' f> = 1- 

Therefore, in our distribution V, we will define g' = v=^ , 

It remains to show how we obtain t'^ with the desired properties. For this purpose, we use floating-point 
arithmetic. Each positive number that we obtain during the computation is stored as a pair {S, E) repre- 
senting S ■ 2^ . We require that 2°' < S < 2"+^ and \E\ < /3, for some a and /? to be set later. If we can 
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perform all standard operations on these integers in 0(1) time, then we can perform the operations on the 
represented positive real numbers in 0(1) time as well. We call S a significand and E an exponent. 

In particular, to multiply two numbers {Si, Ei) and (S'2, E2) it suffices to multiply 5i and S2, truncate 
the least significant bits of the product, and set the new exponent accordingly. If these two numbers are 
multiplicative (1 it (5i)- and (1 it ) -approximations to some quantities Xi and X2, respectively, then the 
product of Si and 52 in our arithmetic is a multiplicative (1 ± {5i + 82 + 8162 + 2~"))-approximation to 
X1X2. If 5i < 1, then the product is a (1 it {5i + 2^2 + 2~"))-approximation. 

For each i of interest, one can easily compute a multiplicative (libC-2~")-approximation for 
in our arithmetic, where C > 1 is a constant. We make the assumption that 3CA;2" < 1, which we satisfy 
later by setting a sufficiently lai^ge a. Hence we use Equation |3T] to obtain a sequence of multiplicative 
(1 ± 3C/c2~°)-approximations t[ for tj, where < i < s. At the end, we find the maximum t[, which 
is represented as a pair {Si, Ei). For all other t'-, we no longer require that Si > 2" and we modify their 
representation {Si,Ei) so that Ei is the same as in the representation of the maximum t •. In the process we 
may lose least significant bits of the some t'^ or even all non-zero bits. Assuming again that 3Cfc2~° < 1, 
the maximum additive error |ti — t-| we get for each i for the modified representation is bounded by 

3CA;2^" • ti + 2"" • maxt'- < 3Cfc2~" • + 2 • 2~" • maxL- < (3C/c + 2) • 2~° • maxt,-, 
i i j 

where the first error term comes from the multiplicative error we obtain approximating each ti and the 
second error term comes from making all exponents in the representation match the exponent of the largest 
i-. Finally, we set a = [log((3CA: + 2) -4(^ + 1)^(5)]. This meets the previous assumption that 3CA;2~° < 1 
and the guarantee on the en^or we may make on each t'- is as desired. Note that since k and Q can be 
represented using a constant number of words, so can integers of size at most 2°''^^. To bound /?, observe 
that every • lies in the range [l/kb,kb], which imphes that all ti lie in [1/kb'' ,kb''], and the 

maximum absolute value of an exponent we need is of order 0{klog{kb)), which can be stored using a 
constant number of machine words. 

To generate a random number from V, we consider only the significands Si in the final modified repre- 
sentation of t-'s, and select each i with probabiUty Si/ J2j<s '^i ~ X]j<s ^'j- The total running time of 
the algorithm is O ( s ) . ■ 

We are ready to prove that the entire algorithm can be implemented efficiently. We use the algorithm of 
Lemma l4.1 ll for efficiently simulating Step|9]in the operation lowest. 

Theorem 4.13 Consider an algorithm A that queries the input graph only via the oracle described as 
Algorithm\J} Let t > 1 be a bound on the expected resulting number of calls in A to the oracles described 
as Algorithm\J\and Algorithm^ and such that t fits into a constant number of machine words using the 
standard binary representation. Let d be an upper bound on the maximum degree of the input graph. 

Suppose that calls to Algorithm [7] are replaced with calls to Algorithm \3\ The oracles described as 
Algorithm\3\and Algorithm^can be implemented in such a way that with probability 4/5 all of the following 
events hold: 

• The number of queries to the graph is 0{t ■ log^{dt)). 

• The total time necessary to compute the answers for the queries to the oracles is 0{t ■ log'^(dt)). 

• The distribution of the answers that the oracle gives is T> such that for some other distribution T>' over 
answers, the convex combination ^ -V + ^ - V is the distribution of answers of the oracle described 
as Algorithm |7] 
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Proof: Let = d • 0{t), where 0{t) is the bound from Lemma |48] on the number of vertices for which 
the operation lowest is called. If the event specified in Lemma 148] occurs, then is an upper bound 
on the number of edges for which the process for generating random numbers is simulated. Let 6^ = 
0{t) ■ {di, + 1) = 0{t log d), where 0{t) is the same bound as above. Then 6^ bounds the number of times 
Step|9]in Algorithm |9] is run, provided the event specified in Lemma l4T8] occurs. Let Q = 206^. 

Let = max{(i^, [log(20a2)]}. Since it is impossible to generate and store real numbers, we assign 
to edges uniform random numbers from the set : 1 < i < 2^*}, instead of the set (0, 1]. This can be 

seen as selecting a random number from (0, 1] and then rounding it up to the next multiplicity of 1/2^*. In 
particular, for every i G {1, . . . , 2^^*}, all numbers in ((i — l)/2'^*, i/2'^*] become . Observe also that 
each range Zj is a union of some number of sets ((i — l)/2'^* , i/2'^*], because Ci, > d-i,. This means that there 
is no need to modify the process for generating random numbers, except for selecting a random i/2'^* in a 
specific Ij, instead of an arbitrary real number from Ij. Observe also that as long we do not select the same 
number i/2^* twice, the entire exploration procedure behaves in the same way as in the idealized algorithm, 
since the ordering of numbers remains the same. 

Note that due to the assumption in the lemma statement, t can be represented in the standard binary form, 
using a constant number of machine words. This is also the case for d, because of the standard assumption 
that we can address all neighbors of all vertices in neighbor queries. This implies that Q = 0{tlogd) 
also has this property. Finally, the probabilities "^^{"""j^^"*"^ can easily be expressed using fractions a/b, 
where a and b are of order 0{d), and therefore, fit into a constant number of machine words as well. This 
implies that we can use the algorithm of Lemma R.l 1[ Instead of directly simulating Step|9l we proceed as 
follows. First, we run the algorithm of Lemma l4. 1 II with the error parameter Q to select a number t of edges 
in T. Then we select a random subset of edges of size t. This can be done in 0{t log d) time. 

We show that the algorithms and data structures can be implemented in such a way that the main claim 
of the theorem holds, provided the following events occur: 

• the events described in the statement of Lemma 1481 

• the rounded numbers assigned to the first edges for which the process for generating random 
numbers is simulated are different, 

• the first 6^ simulations of the algorithm described by Lemma l4.11l do not result in selecting a random 
number from the part on which the output distribution of the algorithm and the binomial distribution 
differ. 

The first of the events does not happen with probability at most 1/10. This follows from Lemma l48l 
Consider the second event. The probability that two random numbers 1/2^^* are identical is bounded by 
1/2'^* < l/(20a2). Consider the first edges for which the process generating random numbers is run. 
The expected number of pairs of the edges that have the same random number is bounded by • 1/ (20aJ) = 
1/20. By Markov's inequality, the probability that two of the edges have the same random number assigned 
is bounded by 1/20. Finally, the probability that the last event does not occur is bounded by 1/20 as well 
via the union bound. Summarizing, the events occur with probability at least 4/5. 

We now bound the running time, provided the above events occur. We assume that we use a standard 
data structure (say, balanced binary search trees) to maintain collections of items. The time necessary for 
each operation in these data structures is of order at most the logarithm of the maximum collection size. 
For instance, we keep a collection of data structures neighbor s[t'] for v that appear in our algorithm. We 
create neighbors[w] for a given v only when it is accessed for the first time. Observe that the number of 
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V for which we have to create neighbors[t;] is bounded by the query complexity 0(t log^(dt)), because 
of how we access vertices. Therefore, accessing each neighbors[i;] requires at most 0(r) time, where 
we write r to denote the logarithm of the bound on the query complexity. That is, r = 0(log t + log log d). 

The time necessary to run Algorithm [3] is bounded by 0{t), which we need to locate the data structure 
neighbor s[i;] for a given v, plus 0(1) time per each call to Algorithm |4] (we do not include the cost of 
running Algorithmic or the operation lowest here; they are analyzed later). The amount of computation 
in Algorithm [3] without the resulting calls to other procedures is bounded by 0(t • r). 

Consider now Algorithmic In every run, we first spend 0(log t) time to check if we have already com- 
puted the answer for a given edge. Then locating the data structures neighbors[ii] and neighbors[f] 
for the endpoints u and v costs at most 0(r). The running time of the reminder of the algorithm requires 
time proportional to the number of recursive calls. Therefore, the total amount of time spent executing 
Algorithm m (without calls to other procedures) is bounded by 0{t ■ r). 

We now bound the running time necessary to execute all operations of data structures neighbors. The 
initialization of neighbors[f ] (Algorithm |6|) for a given v can be done 0(1) time plus 0{t) time necessary 
for inserting the data structure into the collection of all neighbor s[w]. Overall, since at most 0{t log^{dt)) 
data structures are created, the total time necessary to initialize the data structures neighbors[v] is 0{t ■ 
log'^{dt) ■ t). Setting a value for some edge in Algorithm 13 takes at most 0{\ogd) time to insert the value 
into the mapping. This operation is run at most once for every neighbor query, so the total amount of 
computation in this procedure is 0{t ■ \o^{dt) ■ logd). So far, the total computation time is bounded by 
0{tlog'{dt)). 

Clearly, running the operation described by Algorithm |8] takes 0(1) time, so overall the total amount 
of computation in all executions of Algorithm |8] is not greater than some constant times the total amount 
of computation in the operation lowest (Algorithm |9ll. Hence it suffices to bound the total amount of 
computation in Algorithm |9j which we do next. 

Recall that Algorithm |9] is run at most 0{t) times. Therefore all operations in the loop while are run 
at most 0{tlogd) times. The total size of sets S in Step 2 is bounded by the query complexity, and dis- 
covering each element of S costs at most 0{logd) time, if the data structure assigned_number is properly 
implemented, using augmented balanced binary search trees. Therefore the total cost of running Step 2 is 
at most 0{t ■ logd + t ■ log^(dt) • logd) = 0{t ■ log'^{dt) ■ logd). In Step 3, we use the algorithm of 
Lemma 14.111 The total toll for running the algorithm is 0{t ■ log{dt)). Therefore, the total time necessary 
to simulate all executions of Step 2 is bounded by 0((t • log d + t ■ log{dt)) ■ log Q) = 0(t ■ log^(dt)). The 
total number of executions of the body of the loop foreach in Step 4 is bounded by the query complexity 
0{t ■ log^ (dt)) times 2. The time required to execute the body of the loop is dominated by the following 
two kinds of operations. One kind is querying and modifying the data structure assigned_number[tt;] 
and the data structure for S. With a proper implementation (say, augmented balanced binary search trees) 
these operations take at most 0{logd) time each. The other kind of operation is locating neighbors [it;] 
for the discovered neighbor w, which takes most 0(t) time. The total computation time for all executions 
of the loop foreach is therefore bounded by 0{t ■ log^{dt)). 

Finally sorting S never takes more than 0{\S\ log d) time, because |5| < d, and each element of S can 
be added at the end of the Ust sorted in amortized 0(1) time if the list is implemented using extendable 
arrays. This amounts to 0{t ■ log^{dt) ■ log d) in all executions of Step 11. At the end of the operation, the 
requested k^^ adjacent edge can be returned in 0(1) time. 

Summarizing, the computation of the answers of the oracles takes at most 0{t ■ log'^ {dt)) time, if all 
the desired events occur, which happens with probability at least 4/5. Note that when these events occur. 
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then also despite rounding random numbers assigned to edges, the implementation does not diverge from 
the behavior of the idealized oracle. ■ 

5 The Near-Optimal Algorithms 

Theorem 13.11 gives a bound on the expected number of recursive calls to oracles, sufficient to compute an 
answer when the vertex cover oracle is called for a random vertex. The expected number of calls is 0(p • d), 
where p is the ratio between the maximum degree d and the minimum degree (imin> and d is the average 
degree. (Recall that we assume without loss of generality that dmin > 1. For isolated vertices, the oracle 
answers that they are not in the vertex cover in 0(1) time, and therefore, it suffices to focus on the subgraph 
consisting of non-isolated vertices.) 

A straightforward application of Theorem 13.11 gives a bound of 0{d^) for graphs with maximum degree 
bounded by d. We show a bound of 0{d/e) for a modified graph, which is preferable if 1/e < d, and we also 
show how to use the modified graph to obtain an estimate for the minimum vertex cover size in the original 
input graph. We combine the obtained bound with Theorem l4. 13l to get a fast and query-efficient algorithm. 

Next we show how to obtain an efficient algorithm for the case when only the average degree of the 
input graph is bounded. Finally, we show how to adapt the algorithm to the dense graph case, when only 
vertex-pair queries are allowed. 

5.1 Bounded Maximum Degree 

As we have mentioned above, we can assume that 1/e < d. We transform our graph into one with large 
minimum degree, so that the ratio of maximum to minimum degree is small. For a given graph G = {V, E) 
with maximum degree d, consider a graph G = {V , E), such that V = V U V and E = E U E' where 
V' and E' are. defined as follows. The set V' contains a "shadow" vertex v' for each vertex v G V, and E' 
contains [ed\ parallel edges between v and v', and 8d parallel self-loops for v'. 

For a random ranking vr over E, for the output vertex cover (G) on the new graph G, we are interested 
in bounding the size of C^{G) n F as compared to VCopt(G') (the size of a minimum vertex cover of G). 
Since C'^(G) CiV is a vertex cover of G, we have that |C'^(G) n y| > VCopt(G'), and so we focus on an 
upper bound for |C*(G) n y|. 

Let F be the set of all parallel edges connecting each v with the corresponding v'. By the properties of 
the construction of C'^(G) n V, we have 

|C*(G) nV\< 2|M*(G) nE\ + |M*(G) n F| < 2VCopt(G') + |M*(G) n F|. 

Consider an arbitrary ranking vr of E. Observe that for each v G V, the matching M^{G) either includes 
a parallel edge between v and v' or it includes a self-loop incident to v'. For every v' S V, if the lowest 
rank of self-loops incident to v' is lower than the lowest rank of edges {v,v'), then M^{G) contains one 
of the self-loops, and does not contain any parallel edge {v,v'). If the ranking vr is selected uniformly 
at random, the above inequality on ranks does not hold for each vertex independently with probability at 
most ed/8d = e/8. Therefore, the expected number of edges in M'^(G) PI F is upper bounded by en/8. 
Without loss of generality, we can assume that en > 72, since otherwise we can read the entire input with 
only 0(l/e^) queries and compute a maximal matching in it. It follows from the Chernoff bound that with 
probability 1 - 1/20, |M*(G) n F| < en /A. 
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Observe that given query access to G, we can provide query access to G (in particular, the edges in E' 
that are incident to each v can be indexed starting from deg(u) + 1). Every query to G can be answered 
in 0(1) time, using 0(1) queries to G. Therefore, we can simulate an execution of the vertex-cover and the 
maximal-matching oracles on G. 

Note that the expected number of recursive calls to the maximal matching oracle is bounded for a random 
vertex v € V hy 0{d/ e), because the maximum degree and the minimum degree are within a factor of 
0(l/e). Also note that since \V\ = jV^|/2, this expectation for a random vertex u G 1/ is at most twice as 
much, i.e., it is still 0{d/e). 

For any ranking tt of edges in E, if we sample 0(l/e^) vertices from V with replacement, then the 
fraction of those in C*(G) n F is within an additive e/8 of |C*(G) n with probability at least 

1 — 1/20. Let fi be this fraction of vertices. Therefore, we have that 

VCopt(G) - en/4 < ^-n < 2VCopt(G) + en/2 

with probability at least 1 — 1/10. Thus (/i + e/4) • n is the desired (2, en)-estimate. The expected number 
of calls to the vertex cover and maximal matching oracles is bounded by 0{d/e^). Note that without loss 
of generality, e > l/4n, because any additive approximation to within an additive factor smaller than 1/4 
yields in fact the exact value. Therefore the expected number of calls to the oracles is bounded by 0(n'^), 
which can be represented with a constant number of machine words in the standard binary representation, 
using the usual assumption that we can address all vertices of the input graph. By applying now Theo- 
rem 14.131 we obtain an implementation of the algorithm. It runs in 0{d/e^ ■ log^((i/e)) time and makes 
0{d/e^ ■ log^{d/e)) queries. Moreover, the probability that the implementation diverges from the ideal 
algorithm is bounded by 1/5. Therefore, the implementation outputs a (2, en)-estimate with probability 
1 - 1/10 - 1/5 > 2/3. 

Corollary 5.1 There is an algorithm that makes 0(^ • log^ ^) neighbor and degree queries, runs in ■ 
log^ 7) time, and with probability 2/3, outputs a (2, en)-estimate to the minimum vertex cover size. 

5.2 Bounded Average Degree 

In this section, we assume an upper bound d on the average graph degree and show an efficient algorithm 
in this case0 To do this, we will transform the graph into a new graph for which the ratio of the maximum 
degree to the minimum degree is small. 

Our first transformation is to automatically add high degree vertices to the cover, and continue by finding 
a cover for the graph that is induced by the remaining vertices. Given a graph G = {V, E) with average 
degree d, let L denote the subset of vertices in G whose degree is greater than 8d/e. Hence, \L\ < en/8. 
Let E{L) denote the subset of edges in G that are incident to vertices in L, and let G = {V, E) be defined 
hy V = V \ L and E = E \ E{L), so that the maximum degree in G is at most 8d/e. For any maximal 
matching M in G we have that 

VCopt(O) < 2|M| + |L| < 2VCopt(G) + ^n . 

8 

Thus, the first modification we make to the oracles is that if the vertex-cover oracle is called on a vertex v 
such that the degree of v is greater than (4/e)J, then it immediately returns TRUE. 

*As shown in IPR07I . we don't actually need to know d for this purpose, but it suffices to get a bound that is not much higher 
than (4/e)d, and such that the number of vertices with a larger degree is 0{en), where such a bound can be obtained efficiently. 
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The remaining problem is that when we remove the high degree vertices, there are still edges incident 
to vertices with degree at most (4/e)d whose other endpoint is a high degree vertex, and this is not known 
until the appropriate neighbor query is performed. We deal with this by adding shadow vertices to replace 
the removed high degree vertices. At the same time, we increase the minimum degree as in the previous 
subsection. We now create a graph G = {V VJV ,Eyj E) a.?, follows. For every u G F, we add to F a vertex 
v' and vertices v'l, where 1 < i < \degQ{v) / d] and degg.(w) is the degree of v in G. Each of these new 
vertices has 32d/e parallel self-loops. Moreover, we add ^parallel edges between v and v' . Finally, partition 
the edges incident to w in G into \degQ{v) / d] groups, each of size at most d. The first group corresponds 
to the first d edges on the neighborhood list of v, the second group corresponds to the next d edges, and so 
on. Let Ey^i d Ehe. the set of edges in the i-th group. For every i of interest, we add |£'„^j n E{L) \ parallel 
edges between v and v'l and |£'t, j \ E{L) \ parallel self-loops incident to v'l- We add these edges so that we 
are later able to simulate every query to G using a constant number of queries to G. 

Let us bound the total number of vertices in V. The number of vertices is \ V\. The number of vertices 
v'l is bounded by 

E 

because V has been created by removing vertices with highest degrees in G, and the average degree of 
vertices in F in G cannot be greater than d, the initial average degree. This shows that \ V\ < 3\V\. 

We now repeat an ai^gument from the previous section that despite the additional edges and vertices, 
|C'^(G) n V\ is likely to be a good approximation to VCopt(G') for a random ranking vr. First, C'^(G) PI V 
is still a vertex cover for G, so VCoptlG) < |C'^(G) PI V\. Let F be the set of edges connecting all v with 
the corresponding v' and vl. We have 

|C*(G) nF| < 2\M^{G)nE\ + \M^{G)nF\ < 2VCopt(G) + |M*(G) nF|. 

Observe that if for some of the vertices in V, the lowest rank of self-loops is lower than the lowest rank 
of the pai^allel edges connecting this vertex to the corresponding vertex in V, then one of the self-loops is 
selected for the maximal matching as opposed to the parallel edges. The inequality on ranks does not hold 
with probability at most d/ {32d/e) = e/32 independently for each vertex in V. It therefore follows from 
the Chernoff bound that the number of edges in M^{G) n F is not bounded by e|y|/16 with probability at 
most exp(— e|F|/32), which is less than 1/20 if \V\ > 100/e, and we can assume that this is the case. (To 
circumvent the case of |y| < 100/e, we can modify the algorithm as follows. If a sampled vertex belongs 
to a connected component in V of size at most 100/e, then we can read its connected component in G and 
deterministically find a maximal matching that uses only edges in E and self-loops in E. This all takes at 
most 0{d/e^) time, which as we see later, we are allowed to spend per each sampled vertex.) Therefore, we 
have 

|C*(G) nF| < 2VCopt(G) + e|F|/4, 

with probability at least 1 — 1/20. 

Observe that given query access to G, we can efficiently provide query access to G. Degrees of vertices 
in V are the same as in G. For associated vertices in V it is easy to compute their degree in 0(1) time, using 
the degree of the corresponding vertex in V. To answer neighbor queries for vertices v in V, except for the 
fixed connections to v', it suffices to notice that if the corresponding edge in G is connected to a vertex in L, 
this connection is replaced by a connection to an appropriate vertex v". Otherwise, the edge remains in E. 
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For vertices v'l some number of connections can either be a connection to the con^esponding v or a self-loop. 
This can be checked in 0(1) time with a single query to the neighborhood list of v. All the other edges are 
fixed. Therefore, we can simulate an execution of the vertex-cover and the maximal-matching oracles on 
G. Answering every query to G requires 0(1) time and 0(1) queries to G. Sampling vertices uniformly 
at random from in F U F is more involved, but in our algorithm, we only need to sample vertices from V, 
which we assume we can do. 

The expected number of recursive calls to the maximal matching oracle is bounded by 0{d/e'^) for a 
random vertex v ^ V VJ V , because the maximum degree and the minimum degree are within a factor of 
0(l/e) and the maximum degree is bounded by 0{d/e). Note that since 3|y| > \V\, this expectation for a 
random vertex u G F is at most twice as much, i.e., it is still 0{d/e^). 

For any ranking vf of edges in G, if we sample 0(l/e^) vertices from V with replacement, then the 
fraction of those for which the oracle answers TRUE is within an additive eiTor e/8 of the total fraction of 
vertices for which the oracle answers TRUE, with probability 1 — 1/20. Let be the fraction of sampled 
vertices. We have 

VCopt(G) - en/8 < /i • ?i < 2VCopt(G) + en/4 + en/8 + en/8 

with probability 1 — 1/10. Then (^ + e/8)n is the desired (2, en)-estimate. The expected number of 
calls to the vertex cover and maximal matching oracles is bounded hy 0{d/ e^). As before, without loss of 
generality, this quantity can be bounded by O(n^), which fits into a constant number of machine words. By 
applying now Theorem 14. 131 we obtain an implementation of the algorithm. It runs in 0{d/e^ ■ \o^{d/e)) 
time and makes 0{d/e^ ■ log^{d/e)) queries. Moreover, the probability that the implementation diverges 
from the ideal algorithm is bounded by 1/5. Therefore, the implementation outputs a (2, en)-estimate with 
probability at least 1 - 1/5 - 1/10 > 2/3. 

Corollary 5.2 There is an algorithm that makes 0(^ • log^ ^) neighbor and degree queries, runs in ■ 
log^ 7) time, and with probability 2/3, outputs a (2, en)-estimate to the minimum vertex cover size. 

5.3 Adapting the Algorithm to the Vertex-Pair Query Model 

The focus of this paper was on designing a sublinear-time algorithm whose access to the graph is via degree 
queries and neighbor queries. In other words, we assumed that the graph was represented by adjacency 
lists (of known lengths). When a graph is dense (i.e., when the number of edges is B(n^)), then a natural 
alternative representation is by an adjacency matrix. This representation supports queries of the form: "Is 
there an edge between vertex u and vertex vT, which we refer to as vertex-pair queries. 

We next show how to adapt the algorithm described in the previous section to an algorithm that per- 
forms vertex-pair queries. The query complexity and running time of the algorithm are (with high constant 
probability) 0(n/e^), which is linear in the average degree for dense graphs. As in the previous section, the 
algorithm outputs (with high constant probability) a (2, e)-estimate of the size of the minimum vertex cover. 
We recall that the linear lower bound in the average degree ltPR07ll also holds for the case that the average 
degree is 0(n) and when vertex-pair queries ai^e allowed. 

Given a graph G = {V,E), let G = {V,E) be a supergraph of G that is defined as follows. For 
every vertex v ^ V whose degree in G is less tharfl n, there exists a vertex v' G V, where there are 

' If there are no self-loops in the original graph, then the bound is n — 1. 
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n — degQ{v) parallel edges between v and v' , and there are (8/e)n self-loops incident to v' . As shown in 
Subsection 15.11 with high probability over the choice of a ranking tt over G, we have that \C'"{G) r\V\ < 
2VCopt(G) + (e/4)n. 

Note that we can emulate neighbor queries to G given access to vertex-pair queries in G as follows. Let 
the vertices in G be {1, ... , n}. When the j*^ neighbor of vertex i is queried, then the answer to the query 
is j when (i,j) G E, and it is i' (the new auxiliary vertex adjacent to i) when (z, j) ^ E. The degree of 
every vertex in V is n, so there is no need to perform degree queries. Since the ratio between the maximum 
degree and the minimum degree in G is at most 1/e, and the maximum and average degrees are 0{n/e), we 
obtain an algorithm whose complexity is 0(n/e^), as claimed. 
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